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fr 11 f! MmkttV^stE* #imiiiiiis or 5') to end (C-terminus or 3'), such that for an alignment 
that extends to p monomers (where p>x) there are p-x+l such windows, each window has 
at least xy identical aligned monomers, where: x is sleeted from 20, 25, 30, 35, 40, 45, 50, 
60, 70, 80, 90, 100, 150, 200; y is selected from 0.50, 0.60, 0.70, 0.75, 0.80, 0.85, 0.90, 
0.91, 0.92, 0.93, 0.94, 0.95, 0.96, 0.97, 0.98, 0.99; and if xy is not an integer then it is 
rounded up to the nearest integer. The preferred pairwise alignment algorithm is the 
Needleman-Wunsch global alignment algorithm [Needlman &Wunsch ( 1 970) Mol 
Biol 48, 443-453], using default parameters (e.g., with Gap opening penalty = 10.0, and 
with Gap extension penalty = 0.5, using the EBLOSUM62 scoring matrix). This 
algorithm is conveniently implemented in the needle tool in the EMBOSS package [Rice 
etal (2000) Trends Genet. 16:276-277]. 

The nucleic acids and polypeptides of the mention may additionally have further sequences to 
the N-terminus/5' and/or C-terminus/3 ' of these sequences (a) to (d). 

All of the Gram positive bacterial sequences referenced herein are publicly available through 
PubMed on GenBank. 

Streptococcus pneumoniae Adhesin Island Sequences 

As discussed above, a S. pneumoniae AI sequence is present in the TIGR4 S. pneumoniae 
genome. Examples ofS. pneumoniae AI sequences are set forth below. 

SrtD (Sp0468) is a sortase. An example of an amino acid sequence of SrtD is set forth in 
SEQIDNO: 80. 
SEQ ID NO: 80 

MSRTKLRALLGYLLMLVACLIPIYCFGQMVLQSLGQVKGHATFVKSMTTEMYQEQQNHSLAYNQRLASQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLGMGLAHVDGTPLPLDGTGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKS D PQT AAVARVAFT KE GQS VS RVAT S QWL YRGL VVL AFLG IL FVLWKL ARLL RGK 

SrtC (Sp0467) is a sortase. An example of an amino acid sequence of SrtC is set forth in SEQ 
ID NO: 81. 

SEQ ID NO: 81 

MSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAFNATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIG 

YVEIPAIDQEIPMYVGTSEDILQKGAGLLEGASLPVGGENTHTVITAHRGLPTAELFSQLDKMKKGDIFYLHVLD 

QVLAYQVDQIVTVEPNDFEPVLIQHGEDYATIiLTCTPYMINSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWL 
LLGAMAVILLLLYRVYRNRRIVKGLEKQLEGRHVKD 

SrtB (SP0466) is a sortase. An example of an amino acid sequence of SrtB is set forth in 
SEQ ID NO: 82. 
SEQ ID NO: 82 

MAViyiAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDSLNNVVSGDPWSEEMKKKGRAEYARM 
LEIHERMGHVEIPVIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTHAVITAHTGLPTAKMFTDLTKLKVGD 
KFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINTHRLLVRGHRIPYVAEVEEEFIAANK 
L S HL YRYL F Y VAVGL I VI LLWI I RRLRKKKKQPEKALKALKAARKE VKVE DGQQ 

Sp0465 is a hypothetical protein. An example of an amino acid sequence of Sp0465 is set 
forth in SEQ ID NO: 83. 
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MFLPFLSASLYLQTHHFIAFPNRQSYLLRETRKSHFFLIHHPF 

RrgC (SP0464) is a cell wall surface anchor family protein. RrgC contains a sortase substrate 
5 motif VPXTG (SEQ ID NO: 137), shown in italics in SEQ ID NO: 84. 
SEQ ID NO: 84 

MISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRVQIVRDLHS 
WDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAKKTDTMTTK 
VKLIKVDQDHNRLEGVGFKLVSVARDVSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGNYRFKEVEP 
10 LAGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVLQNGKEVVV 
T S GKDGRFRVEGLE YGT YYLWELQAPTGYVQLT S PVS FTI GKDTRKELVT VVKNNKRPRI D VPDTGEETL Y ILML 
VAILLFGSGYYLTKKPNN 

RrgB (Sp0463) is a cell wall surface anchor protein. RrgB contains a sortase substrate motif 
15 IPXTG (SEQ ID NO: 133), shown in italics in SEQ ID NO: 85. 
SEQ ID NO: 85 

MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLLATDGDMDKIANELETGNYAGNKVGVLPANAKE 
IAGVMFVWTNTNNEIIDENGQTLGVNIDPQTFKLSGAMPATAMKKLTEAEGAKFNTANLPAAKYKI YEIHSLSTY 
VGEDGATLTGSKAVPIEIELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIV 

20 TKIPALANYATANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLKLTDAGLAKVNDQNAEKTVKI 
TYSATLNDKAIVEVPESNDVTFNYGNNPDHGNTPKPNKPNENGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTG 
KVVQTVTLTTDKNTVTVNGLDKNTEYKFVERSIKGYSADYQEITTAGEIAVPCNWKDENPKPLDPTEPKVVTYGKK 
FVKVNDKDNRLAGAEFVIANADNAGQYLARKADKVSQEEKQLVVTTKDALDRAVAAYNALTAQQQTQQEKEKVDK 
AQAAYNAAVIAANNAFEWVADKDNENWKLVSDAQGRFEITGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSY 

25 SATGQGIEYTAGSGKDDATKVVNKKITIPQrGGIGTIIFAVAGAAIMGIAVYAYVKNNKDEDQLA 

RrgA (Sp0462) is a cell wall surface anchor protein. RrgA contains a sortase substrate motif 
YPXTG (SEQ ID NO: 186), indicated in italics in SEQ ID NO: 86. 
SEQ ID NO: 86 

30 MLNRETHMKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDG 
TTVSQRTEAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGT 
YPDVQTPYQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTVYEQKDKSVPL 
DVVILLDNSNSMSNIRNKNARRAERAGEATRSLIDKITSDSENRVALVTYASTIFDGTEFTVEKGVADKNGKRLN 
DSLFWNYDQTSFTTNTKDYSYLKLTNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQAR 

35 QNSQKVIFHITDGVPTMSYPINFNHATFAPSYQNQLNAFFSKSPNKDGILLSDFITQATSGEHTIVRGDGQSYQM 
FTDKTVYEKGAPAAFPVKPEKYSEMKAAGYAVIGDPINGGYIWLNWRESILAYPFNSNTAKITNHGDPTRWYYNG 
NIAPDGYDVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQLNRYFHTIVTEKKSIENGTITD 
PMGELIDLQLGTDGRFDPADYTLTANDGSRLENGQAVGGPQNDGGLLKNAKVLYDTTEKRIRVTGLYLGTDEKVT 
LTYNVRLNDEFVSNKFYDTNGRTTLHPKEVEQNTVRDFPIPKIRDVRKYPEITISKEKKLGDIEFIKVNKNDKKP 

40 LRGAVFSLQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVN 
GEVRDVTSIVPQDIPAGYEFTNDKHYITNEPIPPKRE }TPi? TGG I GMLPFYL I GCMMMGGVLLYTRKHP 

RlrA (Sp0461) is a transcriptional regulator. An example of an amino acid sequence for RlrA 
is set forth in SEQ ID NO: 87. 
45 SEQ ID NO: 87 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
50 KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

As discussed above, a S. pneumoniae AI sequence is present in the S. pneumoniae strain 670 
55 genome. Examples of S. pneumoniae AI sequences are set forth below. 
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ii"' 5 ' «- OrfrywiW example of an amino acid sequence of orfl_670 is set forth in 

SEQIDNO: 171. 
SEQ ID NO: 171 

MEHINHTTLLIGIKDKNITLNKAIQHDTHIEVFATLDYHPPKCKHCKGKQIKYDFQKPSKIPFIEIGGFPSLIHL 
KKRRFQCKSCRKVTVAETTLVQKNCQISEMVRQKIAQLLLNREALTHIASKLAISTSTSTVYRKLKQFHFQEDYT 
TLPEILSWDEFSYQKGKLAFIAQDFNTKKIMTILDNRRQTTIRNHFFKYSKEARKKVKVVTVDMSGSYIPLIKKL 
FPNAKIVLDRFHIVQHMSRALNQTRINIMKQFDDKSLEYRALKYYWKFILKDSRKLSLKPFYARTFRETLTPREC 
LKKIFTLVPELKDYYDLYQLLLFHLQEKNTDQFWGLIQDTLPHLNRTFKTTLSTFICYKNYITNAIELPYSNAKL 
E ATNKL I KDI KRN AFG FRN FEN FKKRI FI ALN I KKE RTKF VLS RA 

Orf2_670 is a transcriptional regulator. An example of an amino acid sequence of Orf2_670 
is set forth in SEQ ID NO: 172. 
SEQIDNO: 172 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ' 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLI YQTIVDIRKEAFDKRVAMIAKKAHYXiL 

Orf3_670 is a cell wall surface anchor family proten. An example of an amino acid sequence 
of Orf3 J570 is set forth in SEQ ID NO: 1 73 . 
SEQ ID NO: 173 

MLNRETHMKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELBCNNTDG 
TTVSQRTEAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGT 
YPDVQTPYQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTTVETKEASTPL 
DVVILLDNSNSMSNIRHNHAHRAEKAGEATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILN 
DSALWTFDRTTFTAKTYNYSFLNLTSDPTDXQTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILTKQAR 
PNSKKVIFHITDGVPTMSYPINFKYTGTTQSYRTQLNNFKAKTPNSSGILLEDFVTWSADGEHKIVRGDGESYQM 
FTKKPVTDQYGVHQILSITSMEQRAKLVSAGYRFYGTDLYLYWRDSILAYPFNSSTDWITNHGDPTTWYYNGNMA 
QDGYDVFTVGVGVNGDPGTDEATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNEKKSIENGTITDPMG 
ELIDFQLGADGRFDPADYTLTANDGSSLVNNVPTGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTY 
NVRLNDQFVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPEITIPKEKKLGEIEFIKINKNDKKPLRD 
AVFSLQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVNGEV 
RDVTSIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKHP 

Orf4_670 is a cell wall surface anchor family protein. An example of an amino acid sequence 
of orf4_670 is set forth in SEQ ID NO: 174. 
SEQ ID NO: 174 

MKSINKFLTMLAALLLTASSLFSAATVFAADNVSTAPDAVTKTLTIHKLLLSEDDLKTWDTNGPKGYDGTQSSLK 
DLTGVVAEEIPNVYFELQKYNLTDGKEKENLKDDSKWTTVHGGLTTKDGLKIETSTLKGVYRIREDRTKTTYVGP 
NGQVLTGSKAVPALVTLPLVNNNGTVIDAHVFPKNSYNKPVVDKRIADTLNYNDQNGLSIGTKIPYVVNTTIPSN 
ATFATSFWSDEMTEGLTYNEDVTITLNNVAMDQADYEVTKGNNGFNLKLTEAGLAKINGKDADQKIQITYSATLN 
SLAVADIPESNDITYHYGNHQDHGNTPKPTKPNNGQITVTKTWDSQPAPEGVKATVQLVNAKTGEKVGAPVELSE 
NNWTYTWSGLDNSIEYKVEEEYNGYSAEYTVESKGKLGVKNWKDNNPAPINPEEPRVKTYGKKFVKVDQKDTRLE 
NAQFVVKKADSNKYXAFKSTAQQAADEKAAATAKQKLDAAVAAYTNAADKQAAQALVDQAQQEYNVAYKEAKFGY 
VEVAGKDEAMVLTSNTDGQFQISGLAAGTYKLEEIKAPEGFAKIDDVEFVVGAGSWNQGEFNYLKDVQKNDATKV 
VNKKI T I PQT GG I GTI I FAVAG AAIMG I AV YA Y VKNNKDE DQL A 

Orf5_670 is a cell wall surface anchor family protein. An example of an amino acid sequence 
of orf5_670 is set forth in SEQ ID NO: 175. 
SEQ ID NO: 175 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAK 
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YRFKEVEPLAGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVL 

QNGKEWVTSGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKPNN 

5 

Orf6_670 is a sortase. An example of an amino acid sequence of orf6_670 is set forth in SEQ 
ID NO: 176. 
SEQ ID NO: 176 

MLIKMVKTKKQKRNNLLLGVVFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 

10 LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPVIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 

AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 

HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKKQPEKALICALKAARKEVKVE 
DGQQ 

1 5 Orf7_670 is a sortase. An example of an amino acid sequence of orf7_670 is set forth in SEQ 

ID NO: 177. 
SEQ ID NO: 177 

VSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAFNATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIG 
YVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGENTHTVVTAHRGLPTAELFSQLDKMKKGDVFYLHVLD 
20 QVLAYQVDQILTVE PNDFEPVLIQHGEDYATLLTCTPYMINSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWL 
LLAALVMILVLSYGVYRHRRIVKGLEKQLEEHHVKG 

Orf8_670 is a sortase. An example of an amino acid sequence of orf8_670 is set forth in SEQ 
ID NO: 178. 
25 SEQ ID NO: 178 

MSKAKLQKLLGYLLMLVALVIPVYCFGQMVLQSLGQVKGHEIFSESVTADSYQEQLQRSLDYNQRLDSQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLAMGLAHVDGTPLPVEGKGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVLAFLGILFVLWKLARLLRGK 

30 

As discussed above, a S. pneumoniae AI sequence is present in the 19A Hungary 6 S. 
pneumoniae genome. Examples of S. pneumoniae AI sequences from 19A Hungary 6 are set forth 
below. 

ORF2_19AH is a transcriptional regulator. An example of an amino acid sequence of 
35 ORF2_19AH is set forth in SEQ ID NO: 187. 
SEQ ID NO: 187 

MLNKYIEKRITDKITILNIIiLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSXILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
40 LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

ORF3_19AH is a cell wall surface protein. An example of an amino acid sequence of 
45 ORF3__l 9 AH is set forth in SEQ ID NO: 1 88 . 
SEQ ID NO: 188 

MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRT 
EAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTTVETKEASTPLDVVILLD 
50 NSNSMSNIRHNHAHRAEKAGEATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTF 
DRTTFTAKTYNYS FLNLTS DPTDIQTIKDRI PS DAEELNKDKLMYQFGATFTQKALMTADDILTKQARPNSKKVI 
FHITDGVPTMSYPINFKYTGTTQSYRTQLNNFKAKTPNSSGILLEDFVTWSADGEHKIVRGDGESYQMFTKKPVT 
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TVGVGVNGDPGTDEATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNEKKSIENGTITDPMGELIDFQL 
GADGRFDPADYTLTANDGSSLVNNVPTGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRLNDQ 
FVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPEITIPKEKKLGEIEFIKINKNDKKPLRDAVFSLQK 
5 QHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRL FENSEPAGYKPVQNKPIVAFQIVNGEVRDVTSIV 
PQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKNP 

ORP4_19AH is a cell wall surface protein. An example of an amino acid sequence of 
ORF4J9AH is set forth in SEQ ID NO: 189. 
10 SEQ ID NO: 189 

MKSINKFLTMLAALLLTASSLFSAATVFAADNVSTAPDAVTKTLTIHKLLLSEDDLKTWDTNGPKGYDGTQSSLK 
DLTGVVAEEIPNVYFELQKYNLTDGKEKENLKDDSKWTTVHGGLTTKDGLKIETSTLKGVYRIREDRTKTTYVGP 
NGQVLTGSKAVPALVTLPLVNNNGTVIDAHVFPKNSYNKPVVDKRIADTLNYNDQNGLSIGTKIPYVVNTTIPSN 
ATFATSFWSDEMTEGLTYNEDVTITLNNVAMDQADYEVTKGXNGFNLKLTEAGLAKINGKDADQKIQITYSATLN 
15 SLAVADIPESNDITYHYGNHQDHGNTPKPTKPNNGQITVTKTWDSQPAPEGVKATVQLVNAKTGEKVGAPVELSE 
NNWTYTWSGLDNSIEYKVEEEYNGYSAEYTVESKGKLGVKNWKDNNPAPINPEEPRVKTYGKKFVKVDQKDTRLE 
NAQFVVKKADSNKYIAFKSTAQQAADEKAAATAKQKL DAAVAAYTNAADKQAAQALVDQAQQEYNVAYKEAKFGY 
VEVAGKDEAMVLTSNTDGQFQISGLAAGTYKLEEIKAPEGFAKIDDVE FVVGAGSWNQGEFNYLKDVQKNDATKV 
VNKKI T I PQT GG I GT 1 1 FAVAG AAIMGI AVY A Y VKNNKDE DQL A 

20 

ORP519AH is a cell wall surface protein. An example of an amino acid sequence of 
ORF5_19AH is set forth in SEQ ID NO: 190. 
SEQ ID NO: 190 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
25 QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHI PNGLYYVRSIIQTDAVSYPAE FLFEMTDQTVEPLVIVAK 
KTDTMTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGN 
YRFKEVEPLAGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVL 

QNGKEVVVTSGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKPNN 

30 

ORF6 19AH is a putative sortase. An example of an amino acid sequence of ORF619AH is 
set forth in SEQ ID NO: 191. 
SEQ ID NO: 191 

MLIKMVKTKKQKRNNLLLGVVFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 

35 LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPVIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 

AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 

HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKKQPEBCALKALKAARKEVKVE 
DGQQ 

40 ORF7_l 9 AH is a putative sortase. An example of an amino acid sequence of ORF7_19AH is 

set forth in SEQ ID NO: 192. 
SEQ ID NO: 192 

MDNSRRSRKKGTKKKKHPLILLLIFLVGFAVAIYPLVS RYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
45 NTHTVVTAHRGLPTAELFSQLDKMKKGDVFYLHVLDQVLAYQVDQILTVEPNDFE PVLIQHGEDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLAALVMILVLSYGVYRHRRIVKGLEKQLEEHHVKG 

ORF8_19AH is a putative sortase. An example of an amino acid sequence of ORF8_19AH is 
set forth in SEQ ID NO: 193. 
50 SEQ ID NO: 193 

MSKAKLQKLLGYLLMLVALVIPVYCFGQMVLQSLGQVKGHEIFSESVTADSYQEQLQRSLDYNQRLDSQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLAMGLAHVDGTPLPVEGKGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVLAFMGILFVLWKLARLLRGK 

55 
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As ,t diy^syd , atib^e, ll; a• l S , ! fimmdhiae AI sequence is present in the 6B Finland 12 S. 
pneumoniae genome. Examples of & pneumoniae AI sequences from 6B Finland 12 are set forth 
below. 

ORF26BF is a transcriptional regulator. An example of an amino acid sequence of 
ORF2_6BF is set forth in SEQ ID NO: 194. 
SEQ ID NO: 194 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAI PI FIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIAIiDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

ORF3 6BF is a cell wall surface protein. An example of an amino acid sequence of 
ORF3 6BF is set forth in SEQ ID NO: 195. 
SEQ ID NO: 195 

MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRT 
EAQTGEAI FSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRI YQVNNLDDNQYGIELTVSGKTTVETKEASTPLDVVILLD 
NSNSMSNIRHNHAHRAEKAGEATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTF 
DRTTFTAKTYNYSFLNLTSDPTDIQTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILTKQARPNSKKVI 
FHITDGVPTMSYPINFKYTGTTQSYRTQLNNFKAKTPNSSGILLEDFVTWSADGEHKIVRGDGESYQMFTKKPVT 
DQYGVHQILSITSMEQRAKLVSAGYRFYGTDLYLYWRDSILAYPFNSSTDWITNHGDPTTWYYNGNMAQDGYDVF 
TVGVGVNGDPGTDBATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNEKKSIENGTITDPMGELIDFQL 
GADGRFDPADYTLTANDGSSLVNNVPTGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRLNDQ 
FVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPEITIPKEKKLGEIEFIKINKNDKKPLRDAVFSLQK 
QHP DYPDI YG AI DQNGT YQN VRTGE DGKLT FKNL S DGKYRL FEN S E P AGYKPVQNKP I VAFQI VNGE VRDVT S I V 
PQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKHP 

ORF4_6BF is a cell wall surface protein. An example of an amino acid sequence of 
ORF4__6BF is set forth in SEQ ID NO: 196. 
SEQ ID NO: 196 

MKSINKFLTMLAALLLTASSLFSAATVFAADNVSTAPDAVTKTLTIHKLLLSEDDLKTWDTNGPKGYDGTQSSLK 
DLTGVVAEEIPNVYFELQKYNLTDGKEKENLKDDSKWTTVHGGLTTKDGLKIETSTLKGVYRIREDRTKTTYVGP 
NGQVLTGSKAVPALVTLPLVNNNGTVIDAHVFPKNSYNKPWDKRIADTLNYNDQNGLSIGTKIPYVVNTTIPSN 
ATFATSFWSDEMTEGLTYNEDVTITLNNVAMDQADYEVTKGNNGFNLKLTEAGLAKINGKDADQKIQITYSATLN 
SLAVADIPESNDITYHYGNHQDHGNTPKPTKPNNGQITVTKTWDSQPAPEGVKATVQLVNAKTGEKVGAPVELSE 
NNWTYTWSGLDNSIEYKVEEEYNGYSAEYTVESKGKLGVKNWKDNNPAPINPEEPRVKTYGKKFVKVDQKDTRLE 
NAQFVVKKADSNKYIAFKSTAQQAADEKAAATAKQKLDAAVAAYTNAADKQAAQALVDQAQQEYNVAYKEAKFGY 
VEVAGKDEAMVLTSNTDGQFQISGLAAGTYKLEEIKAPEGFAKIDDVEFVVGAGSWNQGEFNYLKDVQKNDATKV 
VNKKI TI PQTGGI GTI I FAVAGAAIMGI AVYAYVKNNKDEDQLA 

ORF5_6BF is a cell wall surface protein. An example of an amino acid sequence of 
ORF5_6BF is set forth in SEQ ID NO: 197. 
SEQ ID NO: 197 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAE FLFEMTDQTVEPLVIVAK 

KTDTMTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGN 

YRFKEVEPLAGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVL 

QNGKEWVTSGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTWKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKPNN 
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r 1 1* b^6l(®Ms ,: a £utfiive liirtfie? An example of an amino acid sequence of ORF6_6BF is set 
forth in SEQ ID NO: 198. 
SEQ ID NO: 198 

MLIKMVKTKKQKRNNLLLGWFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
5 • LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPVIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGH.DYVTLLTCTPYMINT 

HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKKQPEKALKALKAARKEVKVE 

DGQQ 

10 ORF7_6BF is a putative sortase. An example of an amino acid sequence of ORF7_6BF is set 

forth in SEQ ID NO: 199. 
SEQ ID NO: 199 

MDNSRRSRKKGTKKKKHPLILLLIFLVGFAVAIYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
15 NTHTVVTAHRGLPTAELFSQLDKMKKGDVFYLHVLDQVLAYQVDQILTVEPNDFEPVLIQHGEDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLAALVMILVLSYGVYRHRRIVKGLEKQLEEHHVKG 

ORP8 6BF is a putative sortase. An example of an amino acid sequence of ORF8_6BF is set 
forth in SEQ ID NO: 200. 
20 SEQ ID NO: 200 

MSKAKLQKLLGYLLMLVALVIPVYCFGQMVLQSLGQVKGHEIFSESVTADSYQEQLQRSLDYNQRLDSQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLAMGLAHVDGTPLPVEGKGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVLAFLGILFVLWKLARLLRGK 

25 

As discussed above, a S. pneumoniae AI sequence is present in the 6B Spain 2 8. pneumoniae 
genome. Examples of S. pneumoniae AI sequences from 6B Spain 2 are set forth below. 

ORF2_6BSP is a transcriptional regulator. An example of an amino acid sequence of 
ORF2_6BSP is set forth in SEQ ID NO: 20 1. 
30 SEQ ID NO: 201 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
IiKNLFMYPILMEHCQTYLEPHANMTFTQEELDYlFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
35 KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLI YQTIVDIRKEAFDKRVAMIAKKAHYLL 

ORF3_6BSP is a cell wall surface protein. An example of an amino acid sequence of 
40 ORF3J5BSP is set forth in SEQ ID NO: 202. 
SEQ ID NO: 202 

MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRT 
EAQTGEAIFSNIKPGT.YTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTTVETKEASTPL DVVILLD 

45 NSNSMSNIRHNHAHRAEKAGEATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTF 
DRTTFTAKTYNYSFLNLTSDPTDIQTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILTKQARPNSKKVI 
FHITDGVPTMSYPINFKYTGTTQSYRTQLNNFKAKTPNSSGILLEDFVTWSADGEHKI VRGDGESYQMFTKKPVT 
DQYGVHQILSITSMEQRAKLVSAGYRFYGTDLYLYWRDSILAYPFNSSTDWITNHGDPTTWYYNGNMAQDGYDVF 
TVGVGVNGDPGTDEATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNEKKSIENGTITDPMGELIDFQL 

50 GADGRFDPAD YTLTAN DGS S LVNNVPTGGPQN DGGLLKNAKVFYDTTEKRI RVT GL YLGTGEKVTLT YNVRLN DQ 
FVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPEITIPKEKKLGEIEFIKINKNDKKPLRDAVFSLQK 
QHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVTSIV 
PQD I PAGYE FTNDKHYI TNEPI PPKRE YPRTGGI GML PFYLIGCMMMGGVLL YTRKHP 
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11 J ""' D'RF4^6bSP Is a cell' wall 'surface protein. An example of an amino acid sequence of 
ORF4J5BSP is set forth in SEQ ID NO: 203. 
SEQ ID NO: 203 

MKSINKFLTMLAALLLTASSLFSAATVFAADNVSTAPDAVTKTLTIHKLLLSEDDLKTWDTNGPKGYDGTQSSLK 
5 DLTGVVAEEIPNVYFELQKYNLTDGKEKENLKDDSKWTTVHGGLTTKDGLKIETSTLKGVYRIRE DRTKTTYVGP 
NGQVLTGSKAVPALVTLPLVNNNGTVIDAHVFPKNSYNKPVVDKRIADTLNYNDQNGLSIGTKIPYVVNTTIPSN 
ATFATSFWSDEMTEGLTYNEDVTITLNNVAMDQADYEVTKGNNGFNLKLTEAGLAKINGKDADQKIQITYSATLN 
SLAVADIPESNDITYHYGNHQDHGNTPKPTKPNNGQITVTKTWDSQPAPEGVKATVQLVNAKTGEPCVGAPVELSE 
NNWTYTWSGLDNSIEYKVEEEYNGYSAEYTVESKGKLGVKNWKDNNPAPINPEEPRVKTYGKKFVKVDQKDTRLE 
10 NAQFVVKKADSNKYIAFKSTAQQAADEKAAATAKQKLDAAVAAYTNAADKQAAQALVDQAQQEYNVAYKEAKFGY 
VE VAGKDE AMVLT SNT DGQFQI S GLAAGT YKLEE I KAPE GFAKI D DVE FVVGAGS WNQGE FN YLKDVQKN D ATKV 
VNKKITIPQTGGIGTIIFAVAGAAIMGIAVYAYVKNNKDEDQLA 

ORF5_6BSP is a cell wall surface protein. An example of an amino acid sequence of 
15 OPP5_6BSP is set forth in SEQ ID NO: 204. 
SEQ ID NO: 204 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAE FLFEMTDQTVEPLVIVAK 
KT DTMTTKVKLI KVDQDHNRLEGVGFKL VS VARDGS EKE VPLI GE YRYS S SGQVGRTLYT DKNGE I FVTNL PLGN 
20 YRFKEVEPLAGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVL 
QNGKEVVVTSGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKPNN 

ORF6 6BSP is a putative sortase. An example of an amino acid sequence of ORF6_6BSP is 
25 set forth in SEQ ID NO: 205. 
SEQ ID NO: 205 

MLIPCMVKTKKQKRNNLLLGVVFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPVIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 
30 HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKKQPEKALKALKAARKEVKVE 
DGQQ 

ORF7 6BSP is a putative sortase. An example of an amino acid sequence of ORP7_6BSP is 
set forth in SEQ ID NO: 206. 
35 SEQ ID NO: 206 

MDNSRRSRKKGTKKKKHPLILLLIFLVGFAVAIYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
NTHTVVTAHRGLPTAELFSQLDKMKKGDVFYLHVLDQVLAYQVDQILTVEPNDFEPVLIQHGEDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLAALVMILVLSYGVYRHRRIVKGLEKQLEEHHVKG 

40 

ORF8 6BSP is a putative sortase. An example of an amino acid sequence of ORP8_6BSP is 
set forth in SEQ ID NO: 207. 
SEQ ID NO: 207 

MSKAKLQKLLGYLLMLVALVIPVYCFGQMVLQSLGQVKGHEIFSESVTADSYQEQLQRSLDYNQRLDSQNRIVDP 
45 FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLAMGLAHVDGTPLPVEGKGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVIiAFLGILFVLWKLARLLRGK 

As discussed above, a S, pneumoniae AI sequence is present in the 9V Spain 3 S. pneumoniae 
50 genome. Examples of S. pneumoniae AI sequences from 9V Spain 3 are set forth below. 

ORF2_9VSP is a transcriptional regulator. An example of an amino acid sequence of 
ORP2_9VSP is set forth in SEQ ID NO: 208. 
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MLNKYIEKRITDKITILNILL DIRS IEL DEL STLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
5 LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPI FIILNNQADVNLIKSIIIiRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

10 ORF3J2VSP is a cell wall surface protein. An example of an amino acid sequence of 

ORF3_9VSP is set forth in SEQ ID NO: 209. 
SEQ ID NO: 209 

MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTNGTTVSQRT 
EAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQRTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 

15 YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTVYERKDKSVPLDVVILLD 
NSNSMSNIRNKNARRAERAGEATRSLIDKITSDPENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNY 
DQTSFTTNTKDYSYLKLTNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQARQNSQKVI 
FHITDGVPTMSYPINFNHAT FAPSYQNQLNAFFSKS PNKDGILLSDFITQATSGEHTIVRGDGQSYQMFTDKTVY 
EKGAPAAFPVKPEKYSEMKAVGYAVIGDPINGGYIWLNWRESILAYPFNSNTAKITNHGDPTRWYYNGNIAPDGY 

20 DVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQLNRYFHTIVTEKKSIENGTITDPMGELID 
LQLGTDGRFDPADYTLTANDGSRLENGQAVGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRL 
NDQFVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPAITIAKEKKLGEIEFIKINKNDKKPLRDAVFS 
LQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVT 
SIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLLFYLIGCMMMGGVLLYTRKHP 

25 

ORF4_9VSP is a cell wall surface protein. An example of an amino acid sequence of 
ORP4_9VSP is set forth in SEQ ID NO: 210. 
SEQ ID NO: 210 

MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLLATDGDMDKIANELETGNYAGNKVGVLPANAKE 
30 IAGVMFVWTNTNNEIIDENGQTLGVNIDPQTFKLSGAMPATAMKKLTEAEGAKFNTANLPAAKYKIYEIHSLSTY 
VGEDGATLTGSKAVPIEIELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIV 
TKIPALANYATANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLKLTDAGLAKVNDQWAEKTVKI 
TYSATLNDKAIVEVPESNDVT FNYGNNPDHGNTPKPNKPNENGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTG 
KVVQTVTLTTDKNTVTVNGLDKNTEYKFVERSIKGYSADYQEITTAGEIAVKNWKDENPKPLDPTEPKVVTYGKK 
35 FVKVNDKDNRLAGAEFVIANADNAGQYLARKADKVSQEEKQLVVTTKDALDRAVAAYNALTAQQQTQQEKEKVDK 
AQAAYNAAVIAANNAFEWVADKDNENVVKLVSDAQGRFEITGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSY 
SATGQGIEYTAGSGKDDATKVVNKKITIPQTGGIGTIIFAVAGAVIMGIAVYAYVKNNKDEDQLA 

ORF5 9VSP is a cell wall surface protein. An example of an amino acid sequence of 
40 ORF5 J>VSP is set forth in SEQ ID NO: 211. 
SEQ ID NO: 211 

MTMQKMQKMQKMQKMQKMQKMI ^ 

KLDDSYSYDNRVQIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMT 
DQTVEPLVIVAKKADTVTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKN 
45 GEIVVTNLPLGTYRFKEVEPLAGYTVTTMDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKV 
MKEENGHYTPVLQNGKEVVVASGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNN 
KRPRIDVPDTGEETLYILMLVAILLFGSGYYLTKKTNN 

ORF6_9VSP is a putative sortase. An example of an amino acid sequence of ORF6_9VSP is 
50 set forth in SEQ ID NO: 212. 
SEQ ID NO: 212 

MLIKMAKTKKQKRNNLLLGVVFFIGIAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 

LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPAIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 

AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 

55 HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKRQSERALKALKEATKEVKVE 
DE 
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ORF7_9VSP is a putative sortase. An example of an amino acid sequence of ORF7_9VSP is 
set forth in SEQ ID NO: 213. 
SEQ ID NO: 213 

MSKSRYSRKKSVKKKKNPFILLLIFLVGLAVAMYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAI DQEI PMYVGTSEEILQKGAGLLEGASLPVGGE 

NTHTWTAHRGLPTAELFSQLDKMKKGDIFYLHVLDQVLAYQVDQIVTVEPNDFEPVLIQHGEDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLGAMAVILLLLYRVYRNRRIVKGLEKQLEGRHVKD 

ORF8 9 VSP is a putative sortase. An example of an amino acid sequence of ORF8_9 VSP is 
set forth in SEQ ID NO: 214. 
SEQ ID NO: 214 

MSRTKLRALLGYLLMLVACLIPIYCFGQMVLQSLGQVKGHAT FVKSMTTEMYQEQQNHSLAYNQRLASQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLGMGLAHVDGTPLPLDGTGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKS D PQT AAVARVAFTKE GQS VS RVAT S QWL YRGL VVL AFLGI L FVL WKL ARLLRGK 

As discussed above, a S. pneumoniae AI sequence is present in the 14 CSR 10 S. pneumoniae 
genome. Examples of S. pneumoniae AI sequences from 14 CSR 10 are set forth below, 

ORF2_14CSR is a transcriptional regulator. An example of an amino acid sequence of 
ORF2__14CSR is set forth in SEQ ID NO: 215. 
SEQ ID NO: 215 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQ'LILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALXFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

ORF3_14CSR is a cell wall surface protein. An example of an amino acid sequence of 
ORF3__14CSR is set forth in SEQ ID NO: 216. 
SEQ ID NO: 216 

MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRT 
EAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTTVETKEASTPLDVVILLD 
NSNSMSNIRHNHAHRAEKAGEATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTF 
DRTTFTAKTYNYSFLNLTSDPTDIQTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILTKQARPNSKKVI 
FHITDGVPTMSYPINFKYTGTTQSYRTQLNNFKAKT PNSSGILLEDFVTWSADGEHKIVRGDGESYQMFTKKPVT 
DQYGVHQXLSITSMEQRAKLVSAGYRFYGTDLYLYWRDSILAYPFNSSTDWITNHGDPTTWYYNGNMAQDGYDVF 
TVGVGVNGDPGTDEATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNEKKSIENGTITDPMGELIDFQL 
GADGRFDPADYTLTANDGSSLVNNVPTGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRLNDQ 
FVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPEITIPKEKKLGEIEFIKINKNDKKPLRDAVFSLQK 
QHP D YP D I YGAI DQNGT YQNVRTGE DGKLT FKNLS DGKYRLFENS EPAGYKP VQNKP I VAFQ I VNGEVRDVT S I V 
PQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKHP 

ORF4_14CSR is a cell wall surface protein. An example of an amino acid sequence of 
ORF4__14CSR is set forth in SEQ ID NO: 217. 
SEQ ID NO: 217 

MKSINKFLTMLAALLLTASSLFSAATVFAADNVSTAPDAVTKTLTIHKLLLSEDDLKTWDTNGPKGYDGTQSSLK 
DLTGVVAEEIPNVYFELQKYNLTDGKEKENLKDDSKWTTVHGGLTTKDGLKIETSTLKGVYRIREDRTKTTYVGP 
NGQVLTGSKAVPALVTLPLVNNNGTVIDAHVFPKNSYNKPVVDKRIADTLNYNDQNGLSIGTKIPYVVNTTIPSN 
ATFATSFWSDEMTEGLTYNEDVTITLNNVAMDQADYEVTKGNNGFNLKLTEAGLAKINGKDADQKIQITYSATLN 
SLAVADIPESNDITYHYGNHQDHGNTPKPTKPNNGQITVTKTWDSQPAPEGVKATVQLVNAKTGEKVGAPVELSE 
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NAQFWKKADSNKYIAFKSTAQQAADEKAAATAKQKLDAAVAAYTNAADKQAAQALVDQAQQEYNVAYKEAKFGY 

VEVAGKDEAMVLTSNTDGQFQISGLAAGTYKLEEIPCAPEGFAKIDDVEFVVGAGSWNQGEFNYLKDVQKNDATKV 
VNKKITI PQTGGIGTI I FAVAGAAIMGI AVYAYVKNNKDE DQLA 

5 

ORF5_14CSR is a cell wall surface protein. An example of an amino acid sequence of 
ORF5_14CSR is set forth in SEQ ID NO: 218. 
SEQ ID NO: 218 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 

10 QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAK 

KTDTMTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGN 

YRFKEVEPLAGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVL 

QNGKEVVVTSGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTXGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKPNN 

15 

ORF6_14CSR is a putative sortase. An example of an amino acid sequence of ORF6_14CSR 
is set forth in SEQ ID NO: 2 1 9. 
SEQ ID NO: 219 

MLIKMVKTKKQKRNNLLLGVVFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
20 LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPVIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 
HRLLVRGHRI PYVAE VEE E F I AANKL S HL YR YL F YVAVGL I VI LLWI IRRLRKKKKQPEKALKALKAARKE VKVE 
DGQQ 

25 ORF7_14CSR is a putative sortase. An example of an amino acid sequence of ORF7_14CSR 

is set forth in SEQ ID NO: 220. 
SEQ ID NO: 220 

MDNSRRSRKKGTKKKKHPLILLLIFLVGFAVAIYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
30 NTHTVVTAHRGLPTAELFSQLDKMKKGDVFYLHVLDQVLAYQVDQILTVEPNDFEPVLIQHGEDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLAALVMILVLSYGVYRHRRIVKGLEKQLEEHHVKG 

ORF8__14CSR is a putative sortase. An example of an amino acid sequence of ORF8__14CSR 
is set forth in SEQ ID NO: 221. 
35 SEQ ID NO: 221 

MSKAKLQKLLGYLLMLVALVIPVYCFGQMVLQSLGQVKGHEIFSESVTADSYQEQLQRSLDYNQRLDSQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLAMGLAHVDGTPLPVEGKGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKS DPQTAAVARVAFTKEGQS VS RVAT SQWL YRGL WLAFLGI LFVLWKLARLLRGK 

40 

As discussed above, a S. pneumoniae AI sequence is present in the 19F Taiwan 14 S. 
pneumoniae genome. Examples of S. pneumoniae AI sequences from 19F Taiwan 14 are set forth 
below. 

ORF2_19FTW is a transcriptional regulator. An example of an amino acid sequence of 
45 ORF2__l 9FTW is set forth in SEQ ID NO: 222. 
SEQ ID NO: 222 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
50 LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 
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ORF3_19FTW is a cell wall surface protein. An example of an amino acid sequence of 
ORF3_19FTW is set forth in SEQ ID NO: 223. 
SEQ ID NO: 223 

5 MKKVRKI FQKAVAGLCCI S QLT AFS S I VALAET PET S P AI GKVV I KET GE GGALLG D AVFELKNNT DGTT VS QRT 
EAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRI YQVNNLDDNQYGIELTVSGKTVYERKDKSVPLDVVILLD 
NSNSMSNIRNKNARRAERAGEATRSLIDKITSDPENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNY 
DQTSFTTNTKDYSYLKLTNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQARQNSQKVI 

10 FHITDGVPTMSYPINFNHATFAPSYQNQLNAFFSKSPNKDGILLSDFITQATSGEHTIVRGDGQSYQMFTDKTVY 
EKGAPAAFPVKPEKYSEMKAVGYAVIGDPINGGYIWLNWRESILAYPFNSNTAKITNHGAPTRWYYNGNIAPDGY 
DVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQLNRYFHTIVTEKKSIENGTITDPMGELID 
LQLGTDGRFDPADYTLTANDGSRLENGQAVGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRL 
NDQFVSNKFYDTNGRTTLHPKEVEKNTVRDFPI PKIRDVRKYPAITIAKEKKLGEIEFIKINKNDKKPLRDAVFS 

15 LQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVT 
SIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKHP 

ORF4_19FTW is a cell wall surface protein. An example of an amino acid sequence of 
ORF4J9FTW is set forth in SEQ ID NO: 224. 
20 SEQ ID NO: 224 

MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLLATDGDMDKIANELETGNYAGNKVGVL PANAKE 
IAGVMFVWTNTNNEIIDENGQTLGVNIDPQTFKLSGAMPATAMKKLTEAEGAKFNTANLPAAKYKIYEIHSLSTY 
VGEDGATLTGSKAVPIEIELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIV 
TKIPALANYATANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLKLTDAGLAKVNDQNAEKTVKI 
25 TYSATLNDKAIVEVPESNDVTFNYGNNPDHGNTPKPNKPNENGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTG 
KVVQT VTIiTT DKNT VT VNGL DKNTE YKFVERS I KG YS ADYQE I.TT AGE I AVKNWKDEN PKPL DPTE PKWT YGKK 
FVKVNDKDNRLAGAEFVIANADNAGQYLARKADKVSQEEKQLVVTTKDALDRAVAAYNALTAQQQTQQEKEKVDK 
AQAAYNAAVIAANNAFEWVADKDNENVVKLVSDAQGRFEITGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSY 
SATGQGIEYTAGSGKDDATKVVNKKITIPQTGGIGTIIFAVAGAVIMGIAVYAYVECNNKDEDQLA 

30 

ORF5_19FTW is a cell wall surface protein. An example of an amino acid sequence of 
ORF5_19FTW is set forth in SEQ ID NO: 225. 
SEQ ID NO: 225 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDNRV 
35 QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAK 
KADTVTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIVVTNLPLGT 
YRFKEVEPLAGYTVTTMDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEENGHYTPVL 
QNGKEVVVASGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKTNN 

40 

ORF6_19FTW is a putative sortase. An example of an amino acid sequence of 
ORF6_19FTW is set forth in SEQ ID NO: 226. 
SEQ ID NO: 226 

MLIKMAKTKKQKRNNLLLGVVFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
45 LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPAIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 
HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKRQSERALKALKEATKEVKVE 
DE 

50 ORF7_19FTW is a putative sortase. An example of an amino acid sequence of 

ORF7_19FTW is set forth in SEQ ID NO: 227. 
SEQ ID NO: 227 

MSKSRYSRKKSVKKKKNPFILLLIFLVGLAVAMYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTDQEKKQGVSEYANMLKVHERIGYVEIPAIEQEIPMYVGTSEDILQKGAGLLEGASLPVGGE 
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INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLGAMAVILLLLYRVYRNRRI VKGLEKQLEGRHVKD 

ORF8_19FTW is a putative sortase. An example of an amino acid sequence of 
5 ORF8_19FTW is set forth in SEQ ID NO: 228. 
SEQ ID NO: 228 

MSRTKLRALLGYLLMLVACLIPIYCFGQMVLQSLGQVKGHATFVKSMTTEMYQEQQNHSLAYNQRLASQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLGMGLAHVDGTPLPLDGTGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
10 YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVLAFLGILFVLWKLARLLRGK 

As discussed above, a S. pneumoniae AI sequence is present in the 23F Taiwan 15 S. 
pneumoniae genome. Examples of S. pneumoniae AI sequences from 23F Taiwan 15 are set forth 
below. 

15 ORF2_23FTW is a transcriptional regulator. An example of an amino acid sequence of 

ORF2_23FTW is set forth in SEQ ID NO: 229. 
SEQ ID NO: 229 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
20 LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLI YQTIVDIRKEAFDKRVAMIAKKAHYLL 

25 

ORF3_23FTW is a cell wall surface protein. An example of an amino acid sequence of 
ORF3_23FTW is set forth in SEQ ID NO: 230. 
SEQ ID NO: 230 

MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGXTVSQRT 
30 EAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTVYEQKDKSVPLDVVILLD 
NSNSMSNIRNKNARRAERAGEATRSLIDKITSDPENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNY 
DQTSFTTNTKDYSYLKLTNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMK7VDEILTQQARQNSQKVI 
FHITDGVPTMSYPXNFNHATFAPSYQNQLNAFFSKSPNKDGILLSDFITQATSGEHTIVRGDGQSYQMFTDKTVY 
35 EKGAPAAFPVKPEKYSEMKAAGYAVIGDPINGGYIWLNWRESILAYPFNSNTAKITNHGDPTRWYYNGNIAPDGY 
DVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQLNRYFHTIVTEKKSIENGTITDPMGELID 
LQLGTDGRFDPADYTLTANDGSRLENGQAVGGPQNDGGLLKNAKVLYDTTEKRIRVTGLYLGTDEKVTLTYNVRL 
NDEFVSNKFYDTNGRTTLHPKEVEQNTVRDFPIPKIRDVRKYPEITISKEKKLGDIEFIKVNKNDKKPLRDAVFS 
LQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVT 
40 SIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKHP 

ORF4_2 3 FT W is a cell wall surface protein. An example of an amino acid sequence of 
ORF4_23FTW is set forth in SEQ ID NO: 231. 
SEQ ID NO: 231 

45 MKSINKFLTILAALLLTVSSLFSAATVFAAEQKTKTLTVHKLLMTDQELDAWNSDAXTTAGYDGSQNFEQFKQLQ 
GVPQGVTEISGVAFELQSYTGPQGKEQENLTNDAVWTAVNKGVTTETGVKFDTEVLQGTYRLVEVRKESTYVGPN 
GKVLTGMKAVPALITLPLVNQNGVVENAHVYPKNSEDKPTATKTFDTAAGFVDPGEKGLAIGTKVPYIVTTTIPK 
NSTLATAFWSDEMTEGLDYNGDVVVNYNGQPLDNSHYTLEAGHNGFILKLNEKGLEAINGKDAEATITLKYTATL 
NALAVADVPEANDVTFHYGNNPGHGNTPKPNKPKNGELTITKTWADAKDAPIAGVEVTFDLVNAQTGEVVKVPGH 

50 ETGIVLNQTNNWTFTATGLDNNTEYKFVERTIKGYSADYQTITETGKIAVKNWKDENPEPINPEEPRVKTYGKKF 
VKVDQKDERLKEAQFVVKNEQGKYLALKSAAQQAVNEKAAAEAKQALDT^AIAAYTNAADKNAAQAVVDAAQKTYN 
DNYRAARFGYVEVERKEDALVLTSNTDGQFQISGLAAGSYTLEETKAPEGFAKLGDVKFEVGAGSWNQGDFNYLK 
DVQKNDATKVVNKKITI PQTGGIGT 1 1 FAVAGAVIMGI AV YAYVKNNKDE DQL A 
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Ir" «.«•■ D^PSWw ;! ls'a ietfwffi s : ikrMce protein. An example of an amino acid sequence of 
ORF5 23FTW is set forth in SEQ ID NO: 232. 
SEQ ID NO: 232 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDNRV 
QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAE FLFEMTDQTVEPLVIVAK 
KADTVTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIVVTNLPLGT 
YRFKEVEPLAGYTVTTMDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEENGHYTPVL 
QNGKEVVVASGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKTNN 

ORF6 23FTW is a putative sortase. An example of an amino acid sequence of 
ORF6_23FTW is set forth in SEQ ID NO: 233 . 
SEQ ID NO: 233 

MLIKMVKTKKQKRNNLLLGVVFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
15 LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPVIDVDLPVYAGTAEEVLQQGAGQLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 
HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKKQPEKALKALKAARKEVKVE 
DGQQ 

20 ORF7J23FTW is a putative sortase. An example of an amino acid sequence of 

ORF7„23FTW is set forth in SEQ ID NO: 234. 
SEQ ID NO: 234 

MDNSRRSRKKGTKKKKHPLILLLIFLVGFAVAIYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
25 NTHTVVTAHRGLPTAELFSQLDKMKKGDVFYLHVLDQVLAYQVDQILTVEPNDFEPVLIQHGKDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLAALVMILVLSYGVYRHRRIVKGLEKQLEEHHVKG 

ORF8_23FTW is a putative sortase. An example of an amino acid sequence of 
ORF8_23FTW is set forth in SEQ ID NO: 235. 
30 SEQ ID NO: 235 

MSKAKLQKLLGYLLMLVALVIPVYCFGQMVLQSLGQVKGHEIFSESVTADSYQEQLQRSLDYNQRLDSQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLAMGLAHVDGTPLPVEGKGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIE^TFNKRLLVNFERVAV 
YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVLAFLGILFVLWKLARLLRGK 

35 

As discussed above, a S. pneumoniae AI sequence is present in the 23F Poland 16 S. 
pneumoniae genome. Examples of S. pneumoniae AI sequences from 23F Poland 16 are set forth 
below. 

ORF2_23FP is a transcriptional regulator. An example of an amino acid sequence of 
40 ORF2_23FP is set forth in SEQ ID NO: 236. 
SEQ ID NO: 236 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
45 LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLJIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

50 ORF3_23FP is a cell wall surface protein. An example of an amino acid sequence of 

ORF3_23FP is set forth in SEQ ID NO: 237. 
SEQ ID NO: 237 
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EAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQI IKVDGSEKNGQHKALNPNPYERVI PEGTLSKRI YQVNNLDDNQYGIELTVSGKTTVETKEASTPLDVVILLD 
NSNSMSNIRHNHAHRAEKAGEATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTF 
5 DRTTFTAKTYNYSFLNLTSDPTDIQTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILTKQARPNSKKVI 
FHITDGVPTMSYPINFKYTGTTQSYRTQLNNFKAKTPNSSGILLEDFVTWSADGEHKIVRGDGESYQMFTKKPVT 
DQYGVHQILS ITSMEQRAKLVSAGYRFYGTDLYLYWRDSILAYPFNSSTDWITNHGDPTTWYYNGNMAQDGYDVF 
TVGVGVNGDPGT DEATATRFMQS I S S S PDN YTNVADPSQILQELNRYFYTI VNEKKSI ENGT I T DPMGELI DFQL 
GADGRFDPADYTLTANDGSSLVNNVPTGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRLNDQ 
10 FVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPEITIPKEKKLGEIEFIKINKNDKKPLRDAVFSLQK 
QHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVTSIV 
PQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKNP 

ORF4_23FP is a cell wall surface protein. An example of an amino acid sequence of 
ORF4J23FP is set forth in SEQ ID NO: 238. 
15 SEQ ID NO: 238 

MKSINKFLTMLAALLLTASSLFSAATVFAADNVSTAPDAVTKTLTIHKLLLSEDDLKTWDTNGPKGYDGTQSSLK 
DLTGVVAEEIPNVYFELQKYNLTDGKEKENLKDDSKWTTVHGGLTTKDGLKIETSTLKGVYRIREDRTKTTYVGP 
NGQVLTGSKAVPALVTLPLVNNNGTVIDAHVFPKNSYNKPVVDKRIADTLNYNDQNGLSIGTKIPYVVNTTIPSN 
ATFATSFWSDEMTEGLTYNEDVTITLNNVAMDQADYEVTKGINGFNLKLTEAGLAKINGKDADQKIQITYSATLN 
20 SLAVADIPESNDITYHYGNHQDHGNTPKPTKPNNGQITVTKTWDSQPAPEGVKATVQLVNAKTGEKVGAPVELSE 
NNWTYTWSGLDNSIEYKVEEEYNGYSAEYTVESKGKLGVKNWKDNNPAPINLEEPRVKTYGKKFVKVDQKDTRLE 
NAQ F V VKKAD SNKY I AFKS T AQQAADE KAAAT AKQKL DAAVAA YTN AA DKQAAQAL VDQ AQQE YN VA YKE AKFG Y 
VEVAGKDEAMVLTSNTDGQFQISGLAAGTYKLEEIKAPEGFAKIDDVEFVVGAGSWNQGEFNYLKDVQKNDATKV 
VNKKI TI PQT GGI GT 1 1 FAVAGAVIMG I AVYAYVKNNKDEDQLA 

25 

ORF5__23FP is a cell wall surface protein. An example of an amino acid sequence of 
ORF5_23FP is set forth in SEQ ID NO: 239. 
SEQ ID NO: 239 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDNRV 
30 QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVE PLVIVAK 
KADTVTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIVVTNLPLGT 
YRFKEVEPLAGYAVTTMDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEENGHYTPVL 
QNGKEVVVASGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKTNN 

35 

ORF6_23FP is a putative sortase. An example of an amino acid sequence of ORF6_23FP is 
set forth in SEQ ID NO: 240. 
SEQ ID NO: 240 

MLIKMAKTKKQKRNNLLLGVVFFIGIAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
40 LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPAIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 
HRLL VRGHRI P Y VAE VEEE FI AANKL S HL YRYL F YVAVGL I VI LL W 1 1 RRLRKKKRQS ERALKALKE ATKE VKVE 
DE 

45 ORF7_23FP is a putative sortase. An example of an amino acid sequence of ORF7_23FP is 

set forth in SEQ ID NO: 241. 
SEQ ID NO: 241 

MSKSRYSRKKSVKKKKNPFILLLIFLVGLAVAMYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
50 NTHTVVTAHRGLPTAELFSQLDKMKKGDIFYLHVLDQVLAYQVDQIVTVEPNDFEPVLIQHGEDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLGAMAVILLLLYRVYRNRRIVKGLEKQLEGRHVKD 

ORF8_23FP is a putative sortase. An example of an amino acid sequence of ORF8_23FP is 
set forth in SEQ ID NO: 242. 
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MSRTKLRALLGYLLMLVACLIPIYCFGQMVLQSLGQVKGHATFVKSMTTEMYQEQQNHSLAYNQRLASQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSI PSLEIMEPVYLGADYHHLGMGLAHVDGTPLPLDGTGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
5 YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVLAFLGILFVLWKLARLLRGK 

Immunogenic compositions of the invention comprising AI antigens may further comprise 
one or more antigenic agents. Preferred antigens include those listed below. Additionally, the 
compositions of the present invention may be used to treat or prevent infections caused by any of the 
below-listed microbes. Antigens for use in the immunogenic compositions include, but are not 
limited to, one or more of the following set forth below, or antigens derived from one or more of the 
following set forth below: 
Bacterial Antigens 

N. meningitides: a protein antigen from N. meningitides serogroup A, C, W135, Y, and/or B 
(1-7); an outer-membrane vesicle (OMV) preparation from N. meningitides serogroup B. (8, 9, 10, 
11); a saccharide antigen, including LPS, from N. meningitides serogroup A, B, C W135 and/or Y, 
10 such as the oligosaccharide from serogroup C (see PCTYUS 99/093 46; PCT IB98/01665; and PCT 
IB99/00103); 

Streptococcus pneumoniae: a saccharide or protein antigen, particularly a saccharide from 
Streptooccus pneumoniae; 

Streptococcus agalactiae: particularly, Group B streptococcus antigens; 
15 Streptococcus pyogenes: particularly, Group A streptococcus antigens; 

Enter ococcus faecalis or Enterococcus faecium: Particularly a trisaccharide repeat or other 
Enterococcus derived antigens provided in US Patent No. 6,756,361; 

Helicobacter pylori: including: Cag, Vac, Nap, HopX, HopY and/or urease antigen; 

Bordetella pertussis: such as petussis holotoxin (PT) and filamentous haemagglutinin (FHA) 
20 from B, pertussis, optionally also combination with pertactin and/or agglutinogens 2 and 3 antigen; 

Staphylococcus aureus: including S. aureus type 5 and 8 capsular polysaccharides optionally 
conjugated to nontoxic recombinant Pseudomonas aeruginosa exotoxin A, such as StaphVAX™, or 
antigens derived from surface proteins, invasins (leukocidin, kinases, hyaluronidase), surface factors 
that inhibit phagocytic engulfment (capsule, Protein A), carotenoids, catalase production, Protein A, 
25 coagulase, clotting factor, and/or membrane-damaging toxins (optionally detoxified) that lyse 
eukaryotic cell membranes (hemolysins, leukotoxin, leukocidin); 

Staphylococcus epidermis: particularly, S. epidermidis slime-associated antigen (SAA); 

Staphylococcus saprophyticus: (causing urinary tract infections) particularly the 160 kDa 

* 

hemagglutinin of S. saprophyticus antigen; 
30 Pseudomonas aeruginosa: particularly, endotoxin A, Wzz protein, P. aeruginosa LPS, more 

particularly LPS isolated from PAOl (05 serotype), and/or Outer Membrane Proteins, including 
Outer Membrane Proteins F (OprF) (Infect Immun. 2001 May; 69(5): 3510-3515); 
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P 1 C l?^z2ysS^ as B. anthracis antigens (optionally detoxified) from A- 

components (lethal factor (LF) and edema factor (EF)), both of which can share a common B- 
component known as protective antigen (PA); 

Moraxella catarrhalis: (respiratory) including outer membrane protein antigens (HMW- 
5 OMP), C-antigen, and/or LPS; 

Yersinia pestis (plague): such as Fl capsular antigen (Infect Immun. 2003 Jan; 71(1)): 374- 
383, LPS (Infect Immun. 1999 Oct; 67(10): 5395), Yersinia pestis V antigen (Infect Immun. 1997 
Nov; 65(11): 4476-4482); 

Yersinia enter ocolitica (gastrointestinal pathogen): particularly LPS (Infect Immun. 2002 
10 August; 70(8): 4414); 

Yersinia pseudotuberculosis: gastrointestinal pathogen antigens; 

Mycobacterium tuberculosis: such as lipoproteins, LPS, BCG antigens, a fusion protein of 
antigen 85 B (Ag85B) and/or ESAT-6 optionally formulated in cationic lipid vesicles {Infect Immun. 
2004 October; 72(10): 6148), Mycobacterium tuberculosis (Mtb) isocitrate dehydrogenase associated 
15 antigens (Proc Natl Acad Sci USA. 2004 Aug 24; 101(34): 12652), and/or MPT51 antigens 
(Infect Immun. 2004 July; 72(7): 3829); 

Legionella pneumophila (Legionnairs' Disease): L. pneumophila antigens — optionally 
derived from cell lines with disrupted asd genes (Infect Immun. 1998 May; 66(5): 1898); 

Rickettsia: including outer membrane proteins, including the outer membrane protein A 
20 and/or B (OmpB) (Biochim Biophys Acta. 2004 Nov 1 ; 1702(2): 145), LPS, and surface protein antigen 
(SPA) (JAutoimmun. 1989 Jun;2 Suppl:81); 

E. coli: including antigens from enterotoxigenic E. coli (ETEC), enteroaggregative E. coli 
(EAggEC), diffusely adhering E. coli (DAEC), enteropathogenic E. coli (EPEC), and/or 
enterohemorrhagic E. coli (EHEC); 
25 Vibrio cholerae: including proteinase antigens, LPS, particularly lipopolysaccharides of 

Vibrio cholerae II, Ol Inaba O-specific polysaccharides, V. cholera 0139, antigens of IEM108 
vaccine (Infect Immun. 2003 Oct;71(10):5498-504), and/or Zonula occludens toxin (Zot); 

Salmonella typhi (typhoid fever): including capsular polysaccharides preferably conjugates 
(Vi, i.e. vax-TyVi); 

30 Salmonella typhimurium (gastroenteritis): antigens derived therefrom are contemplated for 

microbial and cancer therapies, including angiogenesis inhibition and modulation of flk; 

Listeria monocytogenes (sytemic infections in immunocompromised or elderly people, 
infections of fetus): antigens derived from L. monocytogenes are preferably used as carriers/vectors 
for intracytoplasmic delivery of conjugates/associated compositions of the present invention; 
35 Porphyromonas gingivalis: particularly, P. gingivalis outer membrane protein (OMP); 

Tetanus', such as tetanus toxoid (TT) antigens, preferably used as a carrier protein in 
conjunction/conjugated with the compositions of the present invention; 
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modulating, inhibiting or associated with ADP ribosylation are contemplated for combination/co- 
administration/conjugation with the compositions of the present invention, the diphtheria toxoids are 
preferably used as carrier proteins; 
5 Borrelia burgdorferi (Lyme disease): such as antigens associated with P39 and P13 (an 

integral membrane protein, Infect Immun. 2001 May; 69(5): 3323-3334), VlsE Antigenic Variation 
Protein (J Clin Microbiol. 1999 Dec; 37(12): 3997); 

Haemophilus influenzae B: such as a saccharide antigen therefrom; 

Klebsiella: such as an OMP, including OMP A, or a polysaccharide optionally conjugated to 
10 tetanus toxoid; 

Neiserria gonorrhoeae: including, a Por (or porin) protein, such as PorB (see Zhu et al., 
Vaccine (2004) 22:660 — 669), a transferring binding protein, such as TbpA and TbpB (See Price et 
al. 7 Infection and Immunity (2004) 71(1):277 - 283), a opacity protein (such as Opa), a reduction- 
modifiable protein (Rmp), and outer membrane vesicle (OMV) preparations (see Plante et aL, J 
15 Infectious Disease (2000) 182:848 - 855), also see e.g. W099/24578, W099/36544, WO99/57280, 
WO02/079243); 

Chlamydia pneumoniae: particularly C pneumoniae protein antigens; 

Chlamydia trachomatis: including antigens derived from serotypes A, B, Ba and C are 
(agents of trachoma, a cause of blindness), serotypes Li, L2 & L 3 (associated with Lymphogranuloma 
20 venereum), and serotypes, D-K; 

Treponema pallidum (Syphilis): particularly a TmpA antigen; and 
Haemophilus ducreyi (causing chancroid): including outer membrane protein (DsrA). 
Where not specifically referenced, further bacterial antigens of the invention may be capsular 
antigens, polysaccharide antigens or protein antigens of any of the above. Further bacterial antigens 
25 may also include an outer membrane vesicle (OMV) preparation. Additionally, antigens include live, 
attenuated, split, and/or purified versions of any of the aforementioned bacteria. The bacterial or 
microbial derived antigens of the present invention may be gram-negative or gram-positive and 
aerobic or anaerobic. 

Additionally, any of the above bacterial-derived saccharides (polysaccharides, LPS, LOS or 
30 oligosaccharides) can be conjugated to another agent or antigen, such as a carrier protein (for example 
CRM 1 97). Such conjugation may be direct conjugation effected by reductive amination of carbonyl 
moieties on the saccharide to amino groups on the protein, as provided in US Patent No. 5,360,897 
and Can J Biochem Cell Biol. 1984 May;62(5):270-5. Alternatively, the saccharides can be 
conjugated through a linker, such as, with succinamide or other linkages provided in Bioconjagate 
35 Techniques, 1996 and CRC, Chemistry of Protein Conjugation and Cross-Linking, 1993. 
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Influenza: including whole virai particles (attenuated), split, or subunit comprising 
hemagglutinin (HA) and/or neuraminidase (NA) surface proteins, the influenza antigens may be 
derived from chicken embryos or propogated on cell culture, and/or the influenza antigens are derived 
5 from influenza type A, B, and/or C, among others; 

Respiratory syncytial virus (RSV): including the F protein of the A2 strain of RSV (J Gen 
Virol 2004 Nov; 85(Pt 1 1):3229) and/or G glycoprotein; 

Parainfluenza virus (PIV): including PIV type 1, 2, and 3, preferably containing 
hemagglutinin, neuraminidase and/or fusion glycoproteins; 
10 Poliovirus: including antigens from a family of picornaviridae, preferably poliovirus antigens 

such as OPV or, preferably IPV; 

Measles: including split measles virus (MV) antigen optionally combined with the Protollin 
and or antigens present in MMR vaccine; 

Mumps: including antigens present in MMR vaccine; 
15 Rubella: including antigens present in MMR vaccine as well as other antigens from 

Togaviridae, including dengue virus; 

Rabies: such as lyophilized inactivated virus (Rab Avert™); 

Flaviviridae viruses: such as (and antigens derived therefrom) yelow fever virus, Japanese 
encephalitis virus, dengue virus (types 1, 2, 3, or 4), tick borne encephalitis virus, and West Nile 
20 virus; 

Caliciviridae; antigens therefrom; 

HIV: including HIV-1 or HIV-2 strain antigens, such as gag (p24gag and p55gag), env 
(gpl60 and gp41), pol, tat, nef, rev vpu, miniproteins, (preferably p55 gag and gpl40v delete) and 
antigens from the isolates HIV mb , HIV SF 2, HIV LAV , HIV LAI , HIV M n, HIV-1 C m 2 35, HIV-W HIV-2; 
25 simian immunodeficiency virus (SIV) among others; 

Rotavirus: including VP4, VP 5, VP6, VP7, VP 8 proteins {Protein Expr Purifl 2004 
Dec;38(2):205) and/or NSP4; 

Pestivirus: such as antigens from classical porcine fever virus, bovine viral diarrhoea virus, 
and/or border disease virus; 
30 Parvovirus: such as parvovirus B19; 

Coronavirus: including SARS virus antigens, particularly spike protein or proteases 
therefrom, as well as antigens included in WO 04/92360; 

Hepatitis A virus: such as inactivated virus; 

Hepatitis B virus: such as the surface and/or core antigens (sAg), as well as the presurface 
35 sequences, pre-Sl and pre-S2 (formerly called pre-S), as well as combinations of the above, such as 
sAg/pre-Sl, sAg/pre-S2, sAg/pre-Sl/pre-S2, and pre-S l/pre-S2, (see, e.g., AHBV Vaccines - Human 
Vaccines and Vaccination, pp. 159-176; and U.S. Patent Nos. 4,722,840, 5,098,704, 5,324,513; 
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ag&iM Birnbaum et al., J. Virol (1990) 64:3319-3330; and 

Zhou et al., J. Virol (1991) 65:5457-5464); 

Hepatitis C vims: such as El, E2, E1/E2 (see, Houghton et al., Hepatology (1991) 14:381), 
NS345 polyprotein, NS 345-core polyprotein, core, and/or peptides from the nonstructural regions 
5 (International Publication Nos. WO 89/04669; WO 90/1 1089; and WO 90/14436); 

Delta hepatitis virus (HDV): antigens derived therefrom, particularly 5-antigen from HDV 
(see, e.g., U.S. Patent No. 5,378,814); 

Hepatitis E virus (HEV); antigens derived therefrom; 

Hepatitis G virus (HGV); antigens derived therefrom; 
10 Varcicella zoster virus: antigens derived from varicella zoster virus (VZV) (/. Gen. Virol 

(1986) 67:1759); 

Epstein-Barr virus: antigens derived from EBV (Baer et al., Nature (1984) 310:207); 

Cytomegalovirus: CMV antigens, including gB and gH {Cytomegaloviruses (J.K. McDougall, 
ed., Springer-Verlag 1990) pp. 125-169); 
15 Herpes simplex virus: including antigens from HSV-1 or HSV-2 strains and glycoproteins gB, 

gD and gH (McGeoch et al., J. Gen. Virol (1988) 69:1531 and U.S. Patent No. 5,171,568); 

Human Herpes Virus: antigens derived from other human herpesviruses such as HHV6 and 
HHV7; and 

HPV: including antigens associated with or derived from human papillomavirus (HPV), for 
20 example, one or more of El — E7, LI, L2, and fusions thereof, particularly the compositions of the 
invention may include a virus-like particle (VLP) comprising the LI major capsid protein, more 
particular still, the HPV antigens are protective against one or more of HPV serotypes 6, 11, 16 and/or 
18. 

Further provided are antigens, compostions, methods, and microbes included in Vaccines, 4 th 
25 Edition (Plotkin and Orenstein ed. 2004); Medical Microbiology 4 th Edition (Murray et al. ed. 2002); 
Virology, 3rd Edition (W.K. Joklik ed. 1988); Fundamental Virology, 2nd Edition (B.N. Fields and 
D.M. Knipe, eds. 1991), which are contemplated in conjunction with the compositions of the present 
invention. 

Additionally, antigens include live, attenuated, split, and/or purified versions of any of the 
30 aforementioned viruses. 
Fungal Antigens 

Fungal antigens for use herein, associated with vaccines include those described in: U.S. Pat. 
Nos. 4,229,434 and 4,368,191 for prophylaxis and treatment of trichopytosis caused by Trichophyton 
mentagrophytes; U.S. Pat. Nos. 5,277,904 and 5,284,652 for a broad spectrum dermatophyte vaccine 
35 for the prophylaxis of dermatophyte infection in animals, such as guinea pigs, cats, rabbits, horses and 
lambs, these antigens comprises a suspension of killed T. equinum, T. mentagrophytes (var. 
granulare), M canis and/or M. gypseum in an effective amount optionally combined with an adjuvant; 
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IJlS.ffiaS a ringworm vaccine comprising an effective amount of a 

homogenized, formaldehyde-killed fungi, i.e., Microsporum canis culture in a carrier; U.S. Pat. No. 
5,948,413 involving extracellular and intracellular proteins for pythiosis. Additional antigens 
identified within antifungal vaccines include Ringvac bovis LTF-130 and Bioveta. 
5 Further, fungal antigens for use herein may be derived from Dermatophytres, including: 

Epidermophyton floccusum, Microsporum audouini, Micro sporum canis, Microsporum distortum, 
Microsporum equinum, Microsporum gypsum, Microsporum nanum, Trichophyton concentricum, 
Trichophyton equinum, Trichophyton gallinae, Trichophyton gypseum, Trichophyton megnini, 
Trichophyton mentagrophytes, Trichophyton quinckeanum, Trichophyton rubrum, Trichophyton 

10 schoenleinii Trichophyton tonsurans, Trichophyton verrucosum, T. verrucosum var. album, var. 
discoides, var. ochraceum, Trichophyton violaceum, and/or Trichophyton faviforme. 

Fungal pathogens for use as antigens or in derivation of antigens in conjunction with the 
compositions of the present invention comprise Aspergillus fumigatus, Aspergillus flavus, Aspergillus 
niger, Aspergillus nidulans, Aspergillus terreus, Aspergillus sydowi, Aspergillus flavatus, Aspergillus 

15 glaucus, Blastoschizomyces capita tus, Candida albicans, Candida enolase, Candida tropicalis, 
Candida glabra ta, Candida krusei, Candida parapsilosis, Candida stellatoidea, Candida kusei, 
Candida parakwsei, Candida lusitaniae, Candida pseudo tropicalis, Candida gtnlliermondi, 
Cladosporium carrionii, Coccidioides immitis, Blastomyces dermatidis, Ciyptococcus neoformans, 
Geotrichum clavatum, Histoplasma capsulatum, Klebsiella pneumoniae, Paracoccidioides 

20 brasiliensis, Pneumocystis carinii, Pythiumn insidiosum, Pityrosporum ovale, Sacharomyces 
cerevisae, Saccharomyces boulardii, Saccharomyces pombe, Scedosporium apiosperum, Sporothrix 

* 

schenckii, Trichosporon beigelii, Toxoplasma gondii, Penicillium marneffei, Malassezia spp., 
Fonsecaea spp., Wangiella spp., Sporothrix spp., Basidiobolus spp., Conidiobolus spp., Rhizopus spp, 
Mucor spp, Absidia spp, Mortierella spp, Cunninghamella spp, and Saksenaea spp. 

25 Other fungi from which antigens are derived include Alternaria spp, Curvularia spp, 

Helminthosporium spp, Fusarium spp, Aspergillus spp, Penicillium spp, Monolinia spp, Rhizoctonia 
spp, Paecilomyces spp, Pithomyces spp, and Cladosporium spp. 

Processes for producing a fungal antigens are well known in the art (see US Patent No. " 
6,333,164). In a preferred method a solubilized fraction extracted and separated from an insoluble 

30 fraction obtainable from fungal cells of which cell wall has been substantially removed or at least 
partially removed, characterized in that the process comprises the steps of: obtaining living fungal 
cells; obtaining fungal cells of which cell wall has been substantially removed or at least partially 
removed; bursting the fungal cells of which cell wall has been substantially removed or at least 
partially removed; obtaining an insoluble fraction; and extracting and separating a solubilized fraction 

35 from the insoluble fraction. 
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In particular embodiments, microbes (bacteria, viruses and/or fungi) against which the present 
compositions and methods can be implement include those that cause sexually transmitted diseases 
(STDs) and/or those that display on their surface an antigen that can be the target or antigen 
composition of the invention. In a preferred embodiment of the invention, compositions are combined 
with antigens derived from a viral or bacterial STD. Antigens derived from bacteria or viruses can be 
administered in conjunction with the compositions of the present invention to provide protection 
against at least one of the following STDs, among others: chlamydia, genital herpes, hepatitis 
(particularly HCV), genital warts, gonorrhoea, syphilis and/or chancroid (See, WO00/15255). 

In another embodiment the compositions of the present invention are co-administered with an 
antigen for the prevention or treatment of an STD. 



Antigens derived from the following viruses associated with STDs, which are described in 
greater detail above, are preferred for co-administration with the compositions of the present 
invention: hepatitis (particularly HCV), HPV, HIV, or HS V. 

Additionally, antigens derived from the following bacteria associated with STDs, which are 
described in greater detail above, are preferred for co-administration with the compositions of the 
present invention: Neiserria gonorrhoeae, Chlamydia pneumoniae, Chlamydia trachomatis, 
Treponema pallidum, or Haemophilus ducreyi. 
Respiratory Antigens 

The antigen may be a respiratory antigen and could further be used in an immunogenic 
composition for methods of preventing and/or treating infection by a respiratory pathogen, including a 
virus, bacteria, or fungi such as respiratory syncytial virus (RSV), PIV, SARS virus, influenza, 
Bacillus anthracis, particularly by reducing or preventing infection and/or one or more symptoms of 
respiratory virus infection. A composition comprising an antigen described herein, such as one 
derived from a respiratory virus, bacteria or fungus is administered in conjunction with the 
compositions of the present invention to an individual which is at risk of being exposed to that 
particular respiratory microbe, has been exposed to a respiratory microbe or is infected with a 
respiratory virus, bacteria or fungus. The composition(s) of the present invention is/are preferably co- 
administered at the same time or in the same formulation with an antigen of the respiratory pathogen. 
Administration of the composition results in reduced incidence and/or severity of one or more 
symptoms of respiratory infection. 
Pediatric/Geriatric Antigens 

In one embodiment the compositions of the present invention are used in conjunction with an 
antigen for treatment of a pediatric population, as in a pediatric antigen. In a more particular 
embodiment the pediatric population is less than about 3 years old, or less than about 2 years, or less 
than about 1 years old. In another embodiment the pediatric antigen (in conjunction with the 
composition of the present invention) is administered multiple times over at least 1, 2, or 3 years. 
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an antigen for treatment of a geriatric population, as in a geriatric antigen. 
Other Antigens 

Other antigens for use in conjunction with the compositions of the present include hospital 
5 acquired (nosocomial) associated antigens. 

In another embodiment, parasitic antigens are contemplated in conjunction with the 
compositions of the present invention. Examples of parasitic antigens include those derived from 
organisms causing malaria and/or Lyme disease. 

In another embodiment, the antigens in conjunction with the compositions of the present 
10 invention are associated with or effective against a mosquito born illness. In another embodiment, the 
antigens in conjunction with the compositions of the present invention are associated with or effective 
against encephalitis. In another embodiment the antigens in conjunction with the compositions of the 
present invention are associated with or effective against an infection of the nervous system. 

In another embodiment, the antigens in conjunction with the compositions of the present 
15 invention are antigens transmissible through blood or body fluids. 
Antigen Formulations 

In other aspects of the invention, methods of producing microparticles 
having adsorbed antigens are provided. The methods comprise: (a) providing an emulsion by 
dispersing a mixture comprising (i) water, (ii) a detergent, (iii) an organic solvent, and (iv) a 
20 biodegradable polymer selected from the group consisting of a poly(a-hydroxy acid), a polyhydroxy 
butyric acid, a polycaprolactone, a polyorthoester, a polyanhydride, and a polycyanoacrylate. The 
polymer is typically present in the mixture at a concentration of about 1% to about 30% relative to the 
organic solvent, while the detergent is typically present in the mixture at a weight-to-weight 
detergent-to-polymer ratio of from about 0.00001:1 to about 0.1:1 (more typically about 0.0001:1 to 
25 about 0.1:1, about 0.001:1 to about 0.1:1, or about 0.005:1 to about 0.1:1); (b) removing the organic 
solvent from the emulsion; and (c) adsorbing an antigen on the surface of the microparticles. In 
certain embodiments, the biodegradable polymer is present at a concentration of about 3% to about 
10% relative to the organic solvent. 

Microparticles for use herein will be formed from materials that are 
30 sterilizable, non-toxic and biodegradable. Such materials include, without limitation, poly(a-hydroxy 
acid), polyhydroxybutyric acid, polycaprolactone, polyorthoester, polyanhydride, PACA, and 
polycyanoacrylate. Preferably, microparticles for use with the present invention are derived from a 
poly(a-hydroxy acid), in particular, from a poly(lactide) ("PLA") or a copolymer of D,L-lactide and 
glycolide or glycolic acid, such as a poly(D,L-lactide-co-glycolide) ("PLG" or "PLGA"), or a 
35 copolymer of D,L-lactide and caprolactone. The microparticles may be derived from any of various 
polymeric starting materials which have a variety of molecular weights and, in the case of the 
copolymers such as PLG, a variety of lactide: glycolide ratios, the selection of which will be largely a 
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discussed more fully below. 



Further antigens may also include an outer membrane vesicle (OMV) preparation. 

4 

Additional formulation methods and antigens (especially tumor antigens) are provided in U.S. 
Patent Serial No. 09/581,772. 
Antigen References 

The following references include antigens useful in conjunction with the compositions of 
the present invention: ' 



1 International patent application W099/24578 

2 International patent application W099/36544. 

3 International patent application WO99/57280. 

4 International patent application WO00/22430. 

5 Tettelin et al. (2000) Science 287:1809-1815. 

6 International patent application W096/29412. 

7 Pizza et al. (2000) Science 287:1816-1820. 

8 PCT WO 01/52885. 

9 Bjune et al. (1991) Lancet 338(8775). 

10 Fuskasawa et al. (1999) Vaccine 17:2951-2958. 

11 Rosenqist et al. (1998) Dev. Biol. Strand 92:323-333. 

12 Constantino et al. (1992) Vaccine 10:691-698. 

13 Constantino et al. (1999) Vaccine 17: 1251-1263. 

14 Watson (2000) Pediatr Infect Dis J 19:331-332. 

15 Rubin (20000) Pediatr Clin North Am 47:269-285,v. 

16 Jedrzejas (2001) Microbiol Mol Biol Rev 65:187-207. 

17 International patent application filed on 3 rd July 2001 claiming priority from GB- 
0016363.4; WO 02/02606; PCT IB/01/00166. 

1 8 Kalman et al . ( 1 999) Nature Genetics 21:385-389. 

19 Read et al. (2000) Nucleic Acids Res 28: 1397-406. 

20 Shirai et al. (2000) J. Infect. Dis 181(Suppl 3):S524-S527. 

21 International patent application WO99/27105. 

22 International patent application WO00/27994. 

23 International patent application WO00/37494. 

24 International patent application W099/28475. 

25 Bell (2000) Pediatr Infect Dis J 19:1 187-1 188. 

26 Iwarson(1995) APMIS 103:321-326. 

27 Gerlich et al. (1990) Vaccine 8 Suppl:S63-68 & 79-80. 

28 Hsu et al. (1999) Clin Liver Dis 3:901-915. 

29 Gastofsson et al. (1996) N. Engl. J. Med. 334-:349-355. 

30 Rappuoli et al. (1991) TIBTECH 9:232-238. 

31 Vaccines (1988) eds. Plotkin & Mortimer. ISBN 0-7216-1946-0. 

32 Del Guidice et al. (1998) Molecular Aspects of Medicine 19: 1-70. 

33 International patent application WO93/0 18150. 

34 International patent application W099/533 1 0. 

35 International patent application WO98/04702. 

36 Ross et al. (2001) Vaccine 19:135-142. 

37 Sutter et al. (2000) Pediatr Clin North Am 47:287-308. 

38 Zimmerman & Spann (1999) Am Fan Physician 59:1 13-1 18, 125-126. 

39 Dreensen (1997) Vaccine 15 Suppl"S2-6. 

40 MMWR Morb Mortal Wkly rep 1998 Jan 16:47(1): 12, 9. 

41 McMichael (2000) Vaccinel9 Suppl 1:S101-107. 
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:51-6. 

0028727.6 & 0105640.7. 

44 Dale (1999) Infect Disclin North Am 13:227-43, viii. 

45 Ferretti et al. (2001) PNAS USA 98: 4658-4663. 

5 46 Kuroda et al. (2001) Lancet 357(9264): 1225-1240; see also pages 1218-1219. 

47 Ramsay et al. (2001) Lancet 357(9251):195-196. 

48 Lindberg (1999) Vaccine 17 Suppl.2:S28-36. 

49 Buttery & Moxon (2000) J R Coil Physicians Long 34:163-168. 

50 Ahmad & Chapnick (1999) InfectDis Clin North Am 13:1 13-133, vii. 
10 51 Goldblatt(1998) J.Med. Microbiol. 47:663-567. 

52 European patent 0 477 508. 

53 U.S. Patent No. 5,306,492. 

54 International patent application W098/42721. 

55 Conjugate Vaccines (eds. Cruse et al.) ISBN 3805549326, particularly vol. 10:48-1 14. 
15 56 Hermanson (1996) Bioconjugate Techniques ISBN: 012323368 & 012342335X. 

57 European patent application 037250 1 . 

58 European patent application 037888 1 . 

59 European patent application 0427347. 

60 International patent application W093/17712. 
20 61 International patent application WG98/58668. 

62 European patent application 0471 177. 

63 International patent application WO00/56360. 

64 International patent application WO 00/67 161. 

25 The contents of all of the above cited patents, patent applications and journal articles are 

incorporated by reference as if set forth fully herein. 

There may be an upper limit to the number of Gram positive bacterial proteins which will be 
in the compositions of the invention. Preferably, the number of Gram positive bacterial proteins in a 
composition of the invention is less than 20, less than 19, less than 18, less than 17, less than 16, less 
30 than 15, less than 14, less than 13, less than 12, less than 11, less than 10, less than 9, less than 8, less 
than 7, less than 6, less than 5, less than 4, or less than 3. Still more preferably, the number of Gram 
positive bacterial proteins in a composition of the invention is less than 6, less than 5, or less than 4. 
Still more preferably, the number of Gram positive bacterial proteins in a composition of the 
invention is 3 . 

35 The Gram positive bacterial proteins and polynucleotides used in the invention are preferably 

isolated, i.e., separate and discrete, from the whole organism with which the molecule is found in 
nature or, when the polynucleotide or polypeptide is not found in nature, is sufficiently free of other 
biological macromolecules so that the polynucleotide or polypeptide can be used for its intended 
purpose. 

40 Fusion Proteins: GBS Al sequences 

The GBS Al proteins used in the invention may be present in the composition as individual 
separate polypeptides, but it is preferred that at least two (Le. 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 
15, 16, 17, or 18) of the antigens are expressed as a single polypeptide chain (a "hybrid" or "fusion 55 
polypeptide). Such fusion polypeptides offer two principal advantages: first, a polypeptide that may 
45 be unstable or poorly expressed on its own can be assisted by adding a suitable fusion partner that 
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overcomies the pmhl^^SQ^o^d^fo^impf,Qial manufacture is simplified as only one expression and 

purification need be employed in order to produce two polypeptides which are both antigenically 
useful. 

The fusion polypeptide may comprise one or more AI polypeptide sequences. Preferably, the 

5 fusion comprises an AI surface protein sequence. Preferably, the fusion polypeptide includes one or 

more of GBS 80, GBS 104, and GBS 67. Most preferably, the fusion peptide includes a polypeptide 

sequence from GBS 80. Accordingly, the invention includes a fusion peptide comprising a first 

amino acid sequence and a second amino acid sequence, wherein said first and second amino acid 

sequences are selected from a GBS AI surface protein or a fragment thereof. Preferably, the first and 

10 second amino acid sequences in the fusion polypeptide comprise different epitopes. 

Hybrids (or fusions) consisting of amino acid sequences from two, three, four, five, six, 

seven, eight, nine, or ten GBS antigens are preferred. In particular, hybrids consisting of amino acid 

sequences from two, three, four, or five GBS antigens are preferred. 

Different hybrid polypeptides may be mixed together in a single formulation. Within such 

1 5 combinations, a GBS antigen may be present in more than one hybrid polypeptide and/or as a 

non-hybrid polypeptide. It is preferred, however, that an antigen is present either as a hybrid or as a 

non-hybrid, but not as both. 

Hybrid polypeptides can be represented by the formula NH 2 -A-{-X-L-} w -B-COOH, wherein: 

X is an amino acid sequence of a GBS AI protein or a fragment thereof; L is an optional linker amino 

20 acid sequence; A is an optional N-terminal amino acid sequence; B is an optional C-terminal amino 

acid sequence; and n is 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14 or 15. 

If a -X- moiety has a leader peptide sequence in its wild-type form, this may be included or 

omitted in the hybrid protein. In some embodiments, the leader peptides will be deleted except for 

that of the -X- moiety located at the N-terminus of the hybrid protein i.e. the leader peptide of Xi will 

25 be retained, but the leader peptides of X 2 . . . X n will be omitted. This is equivalent to deleting all 

leader peptides and using the leader peptide of Xi as moiety -A-. 

For each n instances of {-X-L-}, linker amino acid sequence -L- may be present or absent. 

For instance, when n=2 the hybrid may be NHz-X^Lj-X^-COOH, NHz-X^-COOH, 

NH2-X1-L1-X2-COOH, NH2-X1-X2-L2-COOH, etc. Linker amino acid sequence(s) -L- will typically 

30 be short (e.g. 20 or fewer amino acids i.e. 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). 

Examples comprise short peptide sequences which facilitate cloning, poly-glycine linkers (i.e. 

comprising Gly* where n~ 2, 3, 4, 5, 6, 7, 8, 9, 10 or more), and histidine tags (i.e. His„ where n = 3 , 

4, 5, 6, 7, 8, 9, 10 or more). Other suitable linker amino acid sequences will be apparent to those 

skilled in the art. A useful linker is GSGGGG, with the Gly-Ser dipeptide being formed from a 

35 BamHI restriction site, thus aiding cloning and manipulation, and the (GlyU tetrapeptide being a 

typical poly-glycine linker. 

-A- is an optional N-terminal amino acid sequence. This will typically be short (e.g. 40 or 

fewer amino acids i.e. 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 
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IX..16, 15, J^n. J4ij2, 1 fe8q7 c ^, 5, 4, 3, 2, 1). Examples include leader sequences to direct 
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protein trafficking, or short peptide sequences which facilitate cloning or purification (e.g. histidine 
tags i.e. His„ where n = 3, 4, 5, 6, 7, 8, 9, 10 or more). Other suitable N-terminal amino acid 
sequences will be apparent to those skilled in the art. If Xj lacks its own N-terminus methionine, -A- 
is preferably an oligopeptide (e.g. with 1, 2, 3, 4, 5, 6, 7 or 8 amino acids) which provides a 
N-terminus methionine. 

-B- is an optional C-terminal amino acid sequence. This will typically be short (e.g. 40 or 
fewer amino acids i.e. 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 
18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). Examples include sequences to direct 
protein trafficking, short peptide sequences which facilitate cloning or purification (e.g. comprising 
histidine tags i.e. His,, where n = 3, 4, 5, 6, 7, 8, 9, 10 or more), or sequences which enhance protein 
stability. Other suitable C-terminal amino acid sequences will be apparent to those skilled in the art. 

Most preferably, n is 2 or 3. 
Fusion Proteins: Gram positive bacteria AI sequences 

The Gram positive bacteria AI proteins used in the invention may be present in the 
composition as individual separate polypeptides, but it is preferred that at least two (i.e. 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, 16, 17, or 18) of the antigens are expressed as a single polypeptide chain 
(a "hybrid" or "fusion" polypeptide). Such fusion polypeptides offer two principal advantages: first, a 
polypeptide that may be unstable or poorly expressed on its own can be assisted by adding a suitable 
fusion partner that overcomes the problem; second, commercial manufacture is simplified as only one 
expression and purification need be employed in order to produce two polypeptides which are both 
antigenically useful. 

The fusion polypeptide may comprise one or more AI polypeptide sequences. Preferably, the 
fusion comprises an AI surface protein sequence. Accordingly, the invention includes a fusion 
peptide comprising a first amino acid sequence and a second amino acid sequence, wherein said first 
and second amino acid sequences are selected from a Gram positive bacteria AI protein or a fragment 
thereof. Preferably, the first and second amino acid sequences in the fusion polypeptide comprise 
different epitopes. 

Hybrids (or fusions) consisting of amino acid sequences from two, three, four, five, six, 
seven, eight, nine, or ten Gram positive bacteria antigens are preferred. In particular, hybrids 
consisting of amino acid sequences from two, three, four, or five Gram positive bacteria antigens are 
preferred. 

Different hybrid polypeptides may be mixed together in a single formulation. Within such 
combinations, a Gram positive bacteria AI sequence may be present in more than one hybrid 
polypeptide and/or as a non-hybrid polypeptide. It is preferred, however, that an antigen is present 
either as a hybrid or as a non-hybrid, but not as both. 

Hybrid polypeptides can be represented by the formula NH 2 -A-{-X-L-}„-B-COOH, wherein: 

X is an amino acid sequence of a Gram positive bacteria AI sequence or a fragment thereof; L is an 
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on]l;j[^naLlink©r { |api£iQ j^oici,.se§U'©si6§; A i^an optional N-terminal amino acid sequence; B is an 

optional C-terminal amino acid sequence; and n is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15. 

If a -X- moiety has a leader peptide sequence in its wild-type form, this may be included or 

omitted in the hybrid protein. In some embodiments, the leader peptides will be deleted except for 
5 that of the -X- moiety located at the N-terminus of the hybrid protein i.e. the leader peptide of Xi will 

be retained, but the leader peptides of X 2 . . . X n will be omitted. This is equivalent to deleting all 

leader peptides and using the leader peptide of Xi as moiety -A-. 

For each n instances of {-X-L-}, linker amino acid sequence -L- may be present or absent. 

For instance, when w=2 the hybrid may be NH2-X r L r X 2 -L2-COOH, NH 2 -X!-X 2 -COOH, 
10 NH 2 -Xi-L 1 -X 2 -COOH, NH 2 -Xi-X 2 -L 2 -COOH, etc. Linker amino acid sequence(s) -L- will typically 

be short (e.g. 20 or fewer amino acids Le. 19, 18, 17, 16, 15, 14, 13, 12, 1 1, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). 

Examples comprise short peptide sequences which facilitate cloning, poly-glycine linkers (Le. 

comprising Gly w where n = 2, 3, 4, 5, 6, 7, 8,9, 10 or more), and histidine tags (i.e. His„ where n = 3, 

4, 5, 6, 7, 8, 9, 10 or more). Other suitable linker amino acid sequences will be apparent to those 
15 skilled in the art. A useful linker is GSGGGG, with the Gly-Ser dipeptide being formed from a 

BamHI restriction site, thus aiding cloning and manipulation, and the (Gly) 4 tetrapeptide being a 

typical pory-glycine linker. 

-A- is an optional N-terminal amino acid sequence. This will typically be short (e.g. 40 or 

fewer amino acids i.e. 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 
20 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). Examples include leader sequences to direct 

protein trafficking, or short peptide sequences which facilitate cloning or purification (e.g. histidine 

tags Le. His w where n = 3, 4 3 5, 6, 7, 8, 9, 10 or more). Other suitable N-terminal amino acid 

sequences will be apparent to those skilled in the art. If Xi lacks its own N-terminus methionine, -A- 

is preferably an oligopeptide (e.g. with 1, 2, 3, 4, 5, 6, 7 or 8 amino acids) which provides a 
25 N-terminus methionine. 

-B- is an optional C-terminal amino acid sequence. This will typically be short (e.g. 40 or 

fewer amino acids Le. 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 

18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). Examples include sequences to direct 

protein trafficking, short peptide sequences which facilitate cloning or purification (e.g. comprising 
30 histidine tags Le. His„ where n — 3, 4, 5, 6, 7, 8, 9, 10 or more), or sequences which enhance protein 

stability. Other suitable C-terminal amino acid sequences will be apparent to those skilled in the art. 
Most preferably, n is 2 or 3. 

Antibodies: GBS AI sequences 

The GBS AI proteins of the invention may also be used to prepare antibodies specific to the 
35 GBS AI proteins. The antibodies are preferably specific to the an oligomeric or hyper-oligomeric 

form of an AI protein. The invention also includes combinations of antibodies specific to GBS AI 

proteins selected to provide protection against an increased range of GBS serotypes and strain 

isolates. For example, a combination may comprise a first and second antibody, wherein said first 
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protein. Preferably, the nucleic acid sequence encoding said first GBS AI protein is not present in a 
GBS genome comprising a polynucleotide sequence encoding for said second GBS AI protein. 
Preferably, the nucleic acid sequence encoding said first and second GBS AI proteins are present in 
5 the genomes of multiple GBS serotypes and strain isolates. 

The GBS specific antibodies of the invention include one or more biological moieties that, 
through chemical or physical means, can bind to or associate with an epitope of a GBS polypeptide. 
The antibodies of the invention include antibodies which specifically bind to a GBS AI protein. The 
invention includes antibodies obtained from both polyclonal and monoclonal preparations, as well as 

10 the following: hybrid (chimeric) antibody molecules (see, for example, Winter et al. (1991) Nature 

349 : 293-299; and US Patent No. 4,816,567; F(ab 5 )2 and F(ab) fragments; F v molecules (non-covalent 
heterodimers, see, for example, Inbar et al. (1972) Proc Natl Acad Sci USA 69:2659-2662; and 
Ehrlich et al. (1980) Biochem 19:4091-4096); single-chain Fv molecules (sFv) (see, for example, 
Huston et al. (1988) Proc Natl Acad Sci USA 85:5897-5883); dimeric and trimeric antibody fragment 

15 constructs; minibodies (see, e.g., Pack et al. (1992) Biochem 31: 1579-1584; Cumber et al. (1992) J 
Immunology 149B : 120-126); humanized antibody molecules (see, for example, Riechmann et aL 
(1988) Nature 332:323-327; Verhoeyan et al (1988) Science 239:1534-1536; and U.K. Patent 
Publication No. GB 2,276,169, published 21 September 1994); and, any functional fragments 
obtained from such molecules, wherein such fragments retain immunological binding properties of the 

20 parent antibody molecule. The invention further includes antibodies obtained through non- 
conventional processes, such as phage display. 

Preferably, the GBS specific antibodies of the invention are monoclonal antibodies. 
Monoclonal antibodies of the invention include an antibody composition having a homogeneous 
antibody population. Monoclonal antibodies of the invention may be obtained from murine 

25 hybridomas, as well as human monoclonal antibodies obtained using human rather than murine 

hybridomas. See, e.g., Cote, et al. Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 1985, p 
77. 

The antibodies of the invention may be used in diagnostic applications, for example, to detect 
the presence or absence of GBS in a biological sample. The antibodies of the invention may also be 

30 used in the prophylactic or therapeutic treatment of GBS infection. 
Antibodies: Gram positive bacteria AI sequences 

The Gram positive bacteria AI proteins of the invention may also be used to prepare 
antibodies specific to the Gram positive bacteria AI proteins. The antibodies are preferably specific to 
the an oligomeric or hyper-oligomeric form of an AI protein. The invention also includes 

35 combinations of antibodies specific to Gram positive bacteria AI proteins selected to provide 

protection against an increased range of Gram positive bacteria genus, species, serotypes and strain 
isolates. 
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.„„ ,,|?pr pxa^jpleL ^copabijpajjiQij rAfv^omprise a first and second antibody, wherein said first 

antibody is specific to a first Gram positive bacteria AI protein and said second antibody is specific to 
a second Gram positive bacteria AI protein. Preferably, the nucleic acid sequence encoding said first 
Gram positive bacteria AI protein is not present in a Gram positive bacterial genome comprising a 
polynucleotide sequence encoding for said second Gram positive bacteria AI protein. Preferably, the 
nucleic acid sequence encoding said first and second Gram positive bacteria AI proteins are present in 
the genomes of multiple Gram positive bacteria genus, species, serotypes or strain isolates. 

As an example of an instance where the combination of antibodies provides protection against 
an increased range of bacteria serotypes, the first antibody may be specific to a first GAS AI protein 
and the second antibody may be specific to a second GAS AI protein. The first GAS AI protein may 
comprise a GAS AI-1 surface protein, while the second GAS AI protein may comprise a GAS AI-2 or 
AI-3 surface protein. 

As an example of an instance where the combination of antibodies provides protection against 
an increased range of bacterial species, the first antibody may be specific to a GBS AI protein and the 
second antibody may be specific to a GAS AI protein. Alternatively, the first antibody may be 
specific to a GAS AI protein and the second antibody may be specific to a S. pneumoniae AI protein. 

The Gram positive specific antibodies of the invention include one or more biological 
moieties that, through chemical or physical means, can bind to or associate with an epitope of a Gram 
positive bacteria AI polypeptide. The antibodies of the invention include antibodies which 
specifically bind to a Gram positive bacteria AI protein. The invention includes antibodies obtained 
from both polyclonal and monoclonal preparations, as well as the following: hybrid (chimeric) 
antibody molecules (see, for example, Winter et al. (1991) Nature 349: 293-299; and US Patent No. 
4,816,567; F(ab') 2 andF(ab) fragments; F v molecules (non-covalent heterodimers, see, for example, 
Inbar et al (1972) Proc Natl Acad Sci USA 69:2659-2662; and Ehrlich et al (1980) Biochem 
19:4091-4096); single-chain Fv molecules (sFv) (see, for example, Huston et al (1988) Proc Natl 
Acad Sci USA 85*5897-5883); dimeric and trimeric antibody fragment constructs; minibodies (see, 
e.g., Pack et al (1992) Biochem 31: 1579-1584; Cumber et al. (1992) J Immunology 149B : 120-126); 
humanized antibody molecules (see, for example, Riechmann et al (1988) Nature 332:323-327: 
Verhoeyan et al (1988) Science 239:1534-1536; and U.K. Patent Publication No. GB 2,276,169, 
published 21 September 1994); and, any functional fragments obtained from such molecules, wherein 
such fragments retain immunological binding properties of the parent antibody molecule. The 
invention further includes antibodies obtained through non-conventional processes, such as phage 
display. 

Preferably, the Gram positive specific antibodies of the invention are monoclonal antibodies. 
Monoclonal antibodies of the invention include an antibody composition having a homogeneous 
antibody population. Monoclonal antibodies of the invention may be obtained from murine 
hybridomas, as well as human monoclonal antibodies obtained using human rather than murine 
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77. 

The antibodies of the invention may be used in diagnostic applications, for example, to detect 
the presence or absence of Gram positive bacteria in a biological sample. The antibodies of the 
invention may also be used in the prophylactic or therapeutic treatment of Gram positive bacteria 
infection. 
Nucleic Acids 

The invention provides nucleic acids encoding the Gram positive bacteria sequences and/or 
the hybrid fusion polypeptides of the invention. The invention also provides nucleic acid encoding 

f 

the GBS antigens and/or the hybrid fusion polypeptides of the invention. Furthermore, the invention 
provides nucleic acid which can hybridise to these nucleic acids, preferably under "high stringency" 
conditions (e.g. 65°C in a O.lxSSC, 0.5% SDS solution). 

Polypeptides of the invention can be prepared by various means (e.g. recombinant expression, 
purification from cell culture, chemical synthesis, etc.) and in various forms (e.g. native, fusions, 
non-glycosylated, lipidated, etc.). They are preferably prepared in substantially pure form (i.e. 
substantially free from other GAS or host cell proteins). 

Nucleic acid according to the invention can be prepared in many ways (e.g. by chemical 
synthesis, from genomic or cDNA libraries, from the organism itself, etc.) and can take various forms 
(e.g. single stranded, double stranded, vectors, probes, etc.). They are preferably prepared in 
substantially pure form (i.e. substantially free from other GBS or host cell nucleic acids). 

The term "nucleic acid" includes DNA and RNA, and also their analogues, such as those 
containing modified backbones (e.g. phosphorothioates, etc.), and also peptide nucleic acids (PNA), 
etc. The invention includes nucleic acid comprising sequences complementary to those described 
above (e.g. for antisense or probing purposes). 

The invention also provides a process for producing a polypeptide of the invention, 
comprising the step of culturing a host cell transformed with nucleic acid of the invention under 
conditions which induce polypeptide expression. 

The invention provides a process for producing a polypeptide of the invention, comprising the 
step of synthesising at least part of the polypeptide by chemical means. 

The invention provides a process for producing nucleic acid of the invention, comprising the 
step of amplifying nucleic acid using a primer-based amplification method (e.g. PGR). 

The invention provides a process for producing nucleic acid of the invention, comprising the 
step of synthesising at least part of the nucleic acid by chemical means. 
Purification and Recombinant Expression 

The Gram positive bacteria AI proteins of the invention may be isolated from the native Gram 
positive bacteria, or they may be recombinantly produced, for instance in a heterologous host. For 
example, the GAS, GBS, and S. pneumoniae antigens of the invention may be isolated from 
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instance, in a heterologous host. Preferably, the GBS antigens are prepared using a heterologous host. 

The heterologous host may be prokaryotic (e.g. a bacterium) or eukaryotic. It is preferably 
E.coli, but other suitable hosts include Bacillus subtilis, Vibrio cholerae, Salmonella typhi, Salmonella 
5 typhimurium, Neisseria lactamica, Neisseria cinerea 7 Mycobacteria (e.g. M. tuberculosis), S. gordonii, 
L. lactis, yeasts, etc. 

Recombinant production of polypeptides is facilitated by adding a tag protein to the Gram 

positive bacteria AI sequence to be expressed as a fusion protein comprising the tag protein and the 

Gram positive bacteria antigen. For example, recombinant production of polypeptides is facilitated by 

1 0 adding a tag protein to the GBS antigen to be expressed as a fusion protein comprising the tag protein 

and the GBS antigen. Such tag proteins can facilitate purification, detection and stability of the 

expressed protein. Tag proteins suitable for use in the invention include a polyarginine tag (Arg-tag), 

polyhistidine tag (His-tag), FLAG-tag; Strep-tag, c-myc-tag, S-tag, calmodulin-binding peptide, 

cellulose-binding domain, SBP-tag„ chitin-binding domain, glutathione S-transferase-tag (GST), 

15 maltose-binding protein, transcription termination anti-terminiantion factor (NusA), E. coli 

thioredoxin (TrxA) and protein disulfide isomerase I (DsbA). Preferred tag proteins include His-tag 

and GST. A full discussion on the use of tag proteins can be found at Terpe et al., "Overview of tag 

protein fusions: from molecular and biochemical fundamentals to commercial systems", Appl 

Microbiol Biotechnol (2003) 60:523 - 533. 

20 After purification, the tag proteins may optionally be removed from the expressed fusion 

protein, i.e., by specifically tailored enzymatic treatments known in the art. Commonly used 

proteases include enterokinase, tobacco etch virus (TEV), thrombin, and factor X a . 

GBS polysaccharides 

The compositions of the invention may be further improved by including GBS 

25 polysaccharides. Preferably, the GBS antigen and the saccharide each contribute to the immunological 

response in a recipient. The combination is particularly advantageous where the saccharide and 

polypeptide provide protection from different GBS serotypes. 

The combined antigens may be present as a simple combination where separate saccharide 

and polypeptide antigens are administered together, or they may be present as a conjugated 

30 combination, where the saccharide and polypeptide antigens are covalently linked to each other. 

Thus the invention provides an immunogenic composition comprising (i) one or more GBS 

AI proteins and (ii) one or more GBS saccharide antigens. The polypeptide and the polysaccharide 

may advantageously be covalently linked to each other to form a conjugate. 

Between them, the combined polypeptide and saccharide antigens preferably cover (or 

35 provide protection from) two or more GBS serotypes (e.g. 2, 3, 4, 5, 6, 7, 8 or more serotypes). The 

serotypes of the polypeptide and saccharide antigens may or may not overlap. For example, the 

polypeptide might protect against serogroup II or V, while the saccharide protects against either 

serogroups la, lb, or HI. Preferred combinations protect against the following groups of serotypes: 
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QA,| w to^^ ® d n > ( 3 ) serotypes la and III, (4) serotypes la and IV, (5) 
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serotypes la and V, (6) serotypes la and VI, (7) serotypes la and VII, (8) serotypes la and VIII, (9) 
serotypes lb and II, (10) serotypes lb and III, (11) serotypes lb and IV, (12) serotypes lb and V, (13) 
serotypes lb and VI, (14) serotypes lb and VII, (15) serotypes lb and VIII, 16) serotypes II and III, 
5 (17) serotypes II and IV, (18) serotypes II and V, (19) serotypes II and VI, (20) serotypes II and VII, 
(21) serotypes II and VII, (22) serotypes III and IV, (23) serotypes III and V, (24) serotypes III and 
VI, (25) serotypes III and VII, (26) serotypes III and VIII, (27) serotypes IV and V, (28) serotypes IV 
and VI, (29) serotypes IV and VII, (30) serotypes IV and VIII, (31) serotypes V and VI, (32) 
serotypes V and VII, (33) serotypes V and VIII, (34) serotypes VI and VII, (35) serotypes VI and 
1 0 VIII, and (36) serotypes VII and VIII. 

Still more preferably, the combinations protect against the following groups of serotypes: (1) 
serotypes la and II, (2) serotypes la and V, (3) serotypes lb and II, (4) serotypes lb and V, (5) 
serotypes III and II, and (6) serotypes III and V, Most preferably, the combinations protect against 
serotypes III and V. 

15 Protection against serotypes II and V is preferably provided by polypeptide antigens. 

Protection against serotypes la, lb and/or III may be polypeptide or saccharide antigens. 

Immunogenic compositions and medicaments 

Compositions of the invention are preferably immunogenic compositions, and are more 

preferably vaccine compositions. The pH of the composition is preferably between 6 and 8, 

20 preferably about 7. The pH may be maintained by the use of a buffer. The composition may be 

sterile and/or pyrogen- free. The composition may be isotonic with respect to humans. 

Vaccines according to the invention may either be prophylactic (i.e. to prevent infection) or 

therapeutic (i.e. to treat infection), but will typically be prophylactic. Accordingly, the invention 

includes a method for the therapeutic or prophylactic treatment of a Gram positive bacteria infection 

25 in an animal susceptible to such gram positive bacterial infection comprising administering to said 

animal a therapeutic or prophylactic amount of the immunogenic composition of the invention. For 

example, the invention includes a method for the therapeutic or prophylactic treatment of a 

Streptococcus agalactiae, S. pyogenes, or S. pneumoniae infection in an animal susceptible to 

streptococcal infection comprising administering to said animal a therapeutic or prophylactic amount 

30 of the immunogenic compositions of the invention. 

The invention also provides a composition of the invention for use of the compositions 

described herein as a medicament. The medicament is preferably able to raise an immune response in 

a mammal (i.e. it is an immunogenic composition) and is more preferably a vaccine. 

The invention also provides the use of the compositions of the invention in the manufacture of 

35 a medicament for raising an immune response in a mammal. The medicament is preferably a vaccine. 

The invention also provides kits comprising one or more containers of compositions of the 

invention. Compositions can be in liquid form or can be lyophilized, as can individual antigens. 

Suitable containers for the compositions include, for example, bottles, vials, syringes, and test tubes. 
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Coptajpexs (jap fcg.fomjfed fro^.a^a^i^iOtf materials, including glass or plastic. A container may have 

a sterile access port (for example, the container may be an intravenous solution bag or a vial having a 
stopper pierceable by a hypodermic injection needle). The composition may comprise a first 
component comprising one or more Gram positive bacteria AI proteins. Preferably, the AI proteins 
5 are surface AI proteins. Preferably, the AI surface proteins are in an oligomeric or hyperoligomeric 
form. For example, the first component comprises a combination of GBS antigens or GAS antigens, 
or S. pneumoniae antigens. Preferably said combination includes GBS 80. Preferably GBS 80 is 
present in an oligomeric or hyperoligomeric form. 

The kit can further comprise a second container comprising a pharmaceutically-acceptable 
10 buffer, such as phosphate-buffered saline, Ringer's solution, or dextrose solution. It can also contain 
other materials useful to the end-user, including 'other buffers, diluents, filters, needles, and syringes. 
The kit can also comprise a second or third container with another active agent, for example an 
antibiotic. 

The kit can also comprise a package insert containing written instructions for methods of 
15 inducing immunity against S agalactiae andor S. pyogenes and/or S pneumoniae or for treating S 
agalactiae andor S. pyogenes and/or S pneumoniae infections. The package insert can be an 
unapproved draft package insert or can be a package insert approved by the Food and Drug 
Administration (FDA) or other regulatory body. 

The invention also provides a delivery device pre- filled with the immunogenic compositions 
20 of the invention. 

The invention also provides a method for raising an immune response in a mammal 
comprising the step of administering an effective amount of a composition of the invention. The 
immune response is preferably protective and preferably involves antibodies and/or cell-mediated 
immunity. This immune response will preferably induce long lasting (e.g., neutralising) antibodies 
25 and a cell mediated immunity that can quickly respond upon exposure to one or more GBS and/or 
GAS and/or S. pneumoniae antigens. The method may raise a booster response. 

The invention provides a method of neutralizing GBS, GAS, or 8. pneumoniae infection in a 
mammal comprising the step of administering to the mammal an effective amount of the 
immunogenic compositions of the invention, a vaccine of the invention, or antibodies which recognize 
30 an immunogenic composition of the invention. 

The mammal is preferably a human. Where the vaccine is for prophylactic use, the human is 
preferably a female (either of child bearing age or a teenager). Alternatively, the human may be 
elderly (e.g., over the age of 50, 55, 60, 65, 70 or 75) and may have an underlying disease such as 
diabetes or cancer. Where the vaccine is for therapeutic use, the human is preferably a pregnant 
35 female or an elderly adult. 

These uses and methods are preferably for the prevention and/or treatment of a disease caused 
by Streptococcus agalactiae, or S. pyogenes, or & pneumoniae. The compositions may also be 
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ejj£$Qtiye,.agaijisto The compositions may also be effective against other 

Gram positive bacteria. 

One way of checking efficacy of therapeutic treatment involves monitoring Gram positive 
bacterial infection after administration of the composition of the invention. One way of checking 

' 5 efficacy of prophylactic treatment involves monitoring immune responses against the Gram positive 
bacterial antigens in the compositions of the invention after administration of the composition. 

One way of checking efficacy of therapeutic treatment involves monitoring GBS infection 
after administration of the composition of the invention. One way of checking efficacy of 
prophylactic treatment involves monitoring immune responses against the GBS antigens in the 

1 0 compositions of the invention after administration of the composition. 

A way of assessing the immunogenicity of the component proteins of the immunogenic 
compositions of the present invention is to express the proteins recombinantly and to screen patient 
sera or mucosal secretions by immunoblot. A positive reaction between the protein and the patient 
serum indicates that the patient has previously mounted an immune response to the protein in 

1 5 question- that is, the protein is an immunogen. This method may also be used to identify 
immunodominant proteins and/or epitopes. 

Another way of checking efficacy of therapeutic treatment involves monitoring GBS or GAS 
or S pneumoniae infection after administration of the compositions of the invention. One way of 
checking efficacy of prophylactic treatment involves monitoring immune responses both systemically 

20 (such as monitoring the level of IgGl and IgG2a production) and mucosally (such as monitoring the 
level of IgA production) against the GBS and/or XjAS and/or S pneumoniae antigens in the 
compositions of the invention after administration of the composition. Typically, GBS and/or GAS 
and/or S pneumoniae serum specific antibody responses are determined post-immunization but pre- 
challenge whereas mucosal GBS and/or GAS and/or S pneumoniae specific antibody body responses 

25 are determined post-immunization and post-challenge. 

The vaccine compositions of the present invention can be evaluated in in vitro and in vivo 
animal models prior to host, e.g., human, administration. 

The efficacy of immunogenic compositions of the invention can also be determined in vivo by 
challenging animal models of GBS and/or GAS and/or S pneumoniae infections, e.g., guinea pigs or 

30 mice, with the immunogenic compositions. The immunogenic compositions may or may not be 
derived from the same serotypes as the challenge serotypes. Preferably the immunnogenic 
compositions are derivable from the same serotypes as the challenge serotypes. More preferably, the 
immunogenic composition and/or the challenge serotypes are derivable from the group of GBS and/or 
GAS and/or S pneumoniae serotypes. 

35 In vivo efficacy models include but are not limited to: (i) A murine infection model using 

human GBS and/or GAS and/or S pneumoniae serotypes; (ii) a murine disease model which is a 
murine model using a mouse- adapted GBS and/or GAS and/or S pneumoniae strain, such as those 
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S r$W "IF'^FF^ virulent in mice and (iii) a primate model using human 

GBS or GAS or S pneumoniae isolates. 

The immune response may be one or both of a TH1 immune response and a TH2 response. 
The immune response may be an improved or an enhanced or an altered immune response. 
5 The immune response may be one or both of a systemic and a mucosal immune response. 

Preferably the immune response is an enhanced system and/or mucosal response. 
An enhanced systemic and/or mucosal immunity is reflected in an enhanced TH1 and/or TH2 
immune response. Preferably, the enhanced immune response includes an increase in the production 
of IgGl and/or IgG2a and/or IgA 
10 Preferably the mucosal immune response is a TH2 immune response. Preferably, the mucosal 

immune response includes an increase in the production of IgA. 

Activated TH2 cells enhance antibody production and are therefore of value in responding to 
extracellular infections. Activated TH2 cells may secrete one or more of IL-4, IL-5, IL-6, and IL-10. 
A TH2 immune response may result in the production of IgGl, IgE, IgA and memory B cells for 
1 5 future protection. 

A TH2 immune response may include one or more of an increase in one or more of the 
cytokines associated with a TH2 immune response (such as IL-4, IL-5, IL-6 and IL-10), or an increase 
in the production of IgGl, IgE, IgA and memoiy B cells. Preferably, the enhanced TH2 immune 
resonse will include an increase in IgGl production. 
20 A TH1 immune response may include one or more of an increase in CTLs, an increase in one 

or more of the cytokines associated with a TH1 immune response (such as IL-2, IFNy, and TNFp), an 
increase in activated macrophages, an increase in NK activity, or an increase in the production of 
IgG2a. Preferably, the enhanced TH1 immune response will include an increase in IgG2a production. 
Immunogenic compositions of the invention, in particular, immunogenic composition 
25 comprising one or more GAS antigens of the present invention may be used either alone or in 

combination with other GAS antigens optionally with an immunoregulatory agent capable of eliciting 
a Thl and/or Th2 response. 

Compositions of the invention will generally be administered directly to a patient. Certain 
routes may be favored for certain compositons, as resulting in the generation of a more effective 
30 immune response, preferably a CMI response, or as being less likely to induce side effects, or as being 
easier for administration. Direct delivery may be accomplished by parenteral injection (e.g. 
subcutaneously, intraperitoneally, intradermally, intravenously, intramuscularly, or to the interstitial 
space of a tissue), or by rectal, oral (e.g. tablet, spray), vaginal, topical, transdermal (e.g. see WO 
99/27961) or transcutaneous (e.g. see WO 02/074244 and WO 02/064162), intranasal (e.g. see 
35 WO03/028760), ocular, aural, pulmonary or other mucosal administration. 

The invention may be used to elicit systemic and/or mucosal immunity. 
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„„ ..In que pjptipu^ly^pr^fe^reid embodiment, the immunogenic composition comprises one or 

more GBS or GAS or S pneumoniae antigen(s) which elicits a neutralising antibody response and one 
or more GBS or GAS or S pneumoniae antigen(s) which elicit a cell mediated immune response. In 
this way, the neutralising antibody response prevents or inhibits an initial GBS or GAS or S 
5 pneumoniae infection while the cell-mediated immune response capable of eliciting an enhanced Thl 
cellular response prevents further spreading of the GBS or GAS or S pneumoniae infection. 
Preferably, the immunogenic composition comprises one or more GBS or GAS or S pneumoniae 
surface antigens and one or more GBS or GAS or S pneumoniae cytoplasmic antigens. Preferably the 
immunogenic composition comprises one or more GBS or GAS or S pneumoniae surface antigens or 
10 the like and one or other antigens, such as a cytoplasmic antigen capable of eliciting a Thl cellular 
response. 

Dosage treatment can be a single dose schedule or a multiple dose schedule. Multiple doses 
may be used in a primary immunisation schedule and/or in a booster immunisation schedule. In a 
multiple dose schedule the various doses may be given by the same or different routes e.g. a 

1 5 parenteral prime and mucosal boost, a mucosal prime and parenteral boost, etc. 

The compositions of the invention may be prepared in various forms. For example, the 
compositions may be prepared as injectables, either as liquid solutions or suspensions. Solid forms 
suitable for solution in, or suspension in, liquid vehicles prior to injection can also be prepared {e.g. a 
lyophilised composition). The composition may be prepared for topical administration e.g. as an 

20 ointment, cream or powder. The composition may be prepared for oral administration e.g. as a tablet 
or capsule, as a spray, or as a syrup (optionally flavoured). The composition may be prepared for 
pulmonary administration e.g. as an inhaler, using a fine powder or a spray. The composition may be 
prepared as a suppository or pessary. The composition may be prepared for nasal, aural or ocular 
administration e.g. as drops. The composition may be in kit form, designed such that a combined 

25 composition is reconstituted just prior to administration to a patient. Such kits may comprise one or 
more antigens in liquid form and one or more lyophilised antigens. 

Immunogenic compositions used as vaccines comprise an immunologically effective amount 
of antigen(s), as well as any other components, such as antibiotics, as needed. By 'immunologically 
effective amount', it is meant that the administration of that amount to an individual, either in a single 

30 dose or as part of a series, is effective for treatment or prevention, or increases a measurable immune 
response or prevents or reduces a clinical symptom. This amount varies depending upon the health 
and physical condition of the individual to be treated, age, the taxonomic group of individual to be 
treated (e.g. non-human primate, primate, etc.), the capacity of the individual's immune system to 
synthesise antibodies, the degree of protection desired, the formulation of the vaccine, the treating 

35 doctor's assessment of the medical situation, and other relevant factors. It is expected that the amount 
will fall in a relatively broad range that can be determined through routine trials. 
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The composition of the invention will typically, in addition to the components mentioned 
above, comprise one or more 'pharmaceutically acceptable carriers', which include any carrier that 
does not itself induce the production of antibodies harmful to the individual receiving the 
5 composition. Suitable carriers are typically large, slowly metabolised macromolecules such as 
proteins, polysaccharides, polylactic acids, polyglycolic acids, polymeric amino acids, amino acid 
copolymers, and lipid aggregates (such as oil droplets or liposomes). Such carriers are well known to 
those of ordinary skill in the art. The vaccines may also contain diluents, such as water, saline, 
, glycerol, etc. Additionally, auxiliary substances, such as wetting or emulsifying agents, pH buffering 
1 0 substances, and the like, may be present. A thorough discussion of pharmaceutically acceptable 

excipients is available in Gennaro (2000) Remington: The Science and Practice of Pharmacy. 20th 

ed., ISBN: 0683306472. 

Adjuvants 

Vaccines of the invention may be administered in conjunction with other immunoregulatory 
15 agents. In particular, compositions will usually include an adjuvant. Adjuvants for use with the 
invention include, but are not limited to, one or more of the following set forth below: 
A. Mineral Containing Compositions 

Mineral containing compositions suitable for use as adjuvants in the invention include 
mineral salts, such as aluminum salts and calcium salts. The invention includes mineral salts such as 
20 hydroxides (e.g. oxyhydroxides), phosphates (e.g. hydroxyphosphates, orthophosphates), sulfates, etc. 
(e.g. see chapters 8 & 9 of Vaccine Design... (1995) eds. Powell & Newman. ISBN: 030644867X. 
Plenum.), or mixtures of different mineral compounds (e.g. a mixture of a phosphate and a hydroxide 
adjuvant, optionally with an excess of the phosphate), with the compounds taking any suitable form 
(e.g. gel, crystalline, amorphous, etc.), and with adsorption to the salt(s) being preferred. The mineral 
25 containing compositions may also be formulated as a particle of metal salt (WO 00/23 105). 

Aluminum salts may be included in vaccines of the invention such that the dose of Al 3+ is 
between 0.2 and 1.0 mg per dose. 
B. Oil-Emulsions 

Oil-emulsion compositions suitable for use as adjuvants in the invention include squalene-water 
30 emulsions, such as MF59 (5% Squalene, 0.5% Tween 80, and 0.5% Span 85, formulated into 

submicron particles using a microfluidizer). See WO90/14837. See also, Podda, "The adjuvanted 
influenza vaccines with novel adjuvants: experience with the MF5 9- adjuvanted vaccine", Vaccine 
(2001) 19: 2673-2680; Frey et al., "Comparison of the safety, tolerability, and immunogenicity of a 
MF5 9 - adj uvanted influenza vaccine and a non-adjuvanted influenza vaccine in non-elderly adults", 
35 Vaccine (2003) 21 :4234-4237. MF59 is used as the adjuvant in the FLUAD™ influenza virus 
trivalent subunit vaccine. 
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.,„ ....i.Partipular,]^ in the compositions are submicron oil-in-water 

emulsions. Preferred submicron oil-in-water emulsions for use herein are squalene/water emulsions 
optionally containing varying amounts of MTP-PE, such as a submicron oil-in-water emulsion 
containing 4-5% w/v squalene, 0.25-1.0% w/v Tween 80 ™ (polyoxyelthylenesorbitan monooleate), 
5 and/or 0.25-1.0%o Span 85™ (sorbitan trioleate), and, optionally, N-acetylmuramyl-L-aianyl-D- 

isogluatminyl-L^alanine-2-(r-2-dipalmitoyl-^w-glycero-3-huydroxyphosphophoi-yloxy)-ethylamine 
(MTP-PE), for example, the submicron oil-in-water emulsion known as "MF59" (International 
Publication No. WO 90/14837; US Patent Nos. 6,299,884 and 6,45 1,325, incorporated herein by 
reference in their entireties; and Ott et aL, "MF59 — Design and Evaluation of a Safe and Potent 

10 Adjuvant for Human Vaccines" in Vaccine Design: The Subunit and Adjuvant Approach (Powell, 
M.F. and Newman, M.J. eds.) Plenum Press, New York, 1995, pp. 277-296). MF59 contains 4-5% 
w/v Squalene (e.g. 4.3%), 0.25-0.5% w/v Tween 80™, and 0.5% w/v Span 85™ and optionally 
contains various amounts of MTP-PE, formulated into submicron particles using a microfluidizer such 
as Model 1 10Y microfluidizer (Microfluidics, Newton, MA). For example, MTP-PE may be present 

15 in an amount of about 0-500 ug/dose, more preferably 0-250 ug/dose and most preferably, 0-100 
j-ig/dose. As used herein, the term "MF59-0" refers to the above submicron oil-in-water emulsion 
lacking MTP-PE, while the term MF59-MTP denotes a formulation that contains MTP-PE. For 
instance, "MF59-100" contains 100 ug MTP-PE per dose, and so on. MF69, another submicron oil-in- 
water emulsion for use herein, contains 4.3% w/v squalene, 0.25% w/v Tween 80™, and 0.75% w/v 

20 Span 85™ and optionally MTP-PE. Yet another submicron oil-in-water emulsion is MF75, also 

known as SAF, containing 10% squalene, 0.4% Tween 80™, 5% pluronic-blocked polymer L121, and 
thr-MDP, also microfluidized into a submicron emulsion. MF75-MTP denotes an MF75 formulation 
that includes MTP, such as from 100-400 jag MTP-PE per dose. 

Submicron oil-in-water emulsions, methods of making the same and immunostimulating 

25 agents, such as muramyl peptides, for use in the compositions, are described in detail in International 
Publication No. WO 90/14837 and US Patent Nos. 6,299,884 and 6,45 1,325, incorporated herein by 
reference in their entireties. 

Complete Freund's adjuvant (CFA) and incomplete Freund's adjuvant (IF A) may also be 
used as adjuvants in the invention. 

30 C. Saponin Formulations 

Saponin formulations, may also be used as adjuvants in the invention. Saponins are a 
heterologous group of sterol glycosides and triterpenoid glycosides that are found in the bark, leaves, 
stems, roots and even flowers of a wide range of plant species. Saponin from the bark of the Quillaia 
saponaria Molina tree have been widely studied as adjuvants. Saponin can also be commercially 

35 obtained from Smilax ornata (sarsaprilla), Gypsophilla paniculata (brides veil), and Saponaria 

officianalis (soap root). Saponin adjuvant formulations include purified formulations, such as QS21, 
as well as lipid formulations, such as ISCOMs. 
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...Sa^p^ Goppgsitjiopg, ^ye; } b©e^ : ©urified using High Performance Thin Layer 

Chromatography (HP-LC) and Reversed Phase High Performance Liquid Chromatography (RP- 
HPLC). Specific purified fractions using these techniques have been identified, including QS7, QS17, 
QS18, QS21, QH-A, QH-B and QH-C. Preferably, the saponin is QS21. A method of production of 
QS21 is disclosed in US Patent No. 5,057,540. Saponin formulations may also comprise a sterol, such 
as cholesterol (see W096/33739). 

Combinations of saponins and cholesterols can be used to form unique particles called 
Immunostimulating Complexs (ISCOMs). ISCOMs typically also include a phospholipid such as 
phosphatidylethanolamine or phosphatidylcholine. Any known saponin can be used in ISCOMs. 
Preferably, the ISCOM includes one or more of Quil A, QHA and QHC. ISCOMs are further 
described in EP0109942, WO 96/1 171 1 and WO 96/33739. Optionally, the ISCOMS may be devoid 
of additional detergent. See WO 00/07621. 

A review of the development of saponin based adjuvants can be found at Barr, et al., 
"ISCOMs and other saponin based adjuvants", Advanced Drug Delivery Reviews (1998) 32:247-271. 
See also Sjolander, et al., "Uptake and adjuvant activity of orally delivered saponin and ISCOM 
vaccines", Advanced Drug Delivery Reviews (1998) 32:321-338. 

D. Virosomes and Virus Like Particles (VLPs) 

Virosomes and Virus Like Particles (VLPs) can also be used as adjuvants in the invention. 
These structures generally contain one or more proteins from a virus optionally combined or 
formulated with a phospholipid.. They are generally non-pathogenic, non-replicating and generally do 
not contain any of the native viral genome. The viral proteins may be recombinantly produced or 
isolated from whole viruses. These viral proteins suitable for use in virosomes or VLPs include 
proteins derived from influenza virus (such as HA or NA), Hepatitis B virus (such as core or capsid 
proteins), Hepatitis E virus, measles virus, Sindbis virus, Rotavirus, Foot-and-Mouth Disease virus, 
Retrovirus, Norwalk virus, human Papilloma virus, HIV, RNA-phages, QB-phage (such as coat 
proteins), GA-phage, fr-phage, AP205 phage, and Ty (such as retrotransposon Ty protein pi). VLPs 
are discussed further in WO 03/024480, WO 03/024481, and Niikura et al., "Chimeric Recombinant 
Hepatitis E Virus-Like Particles as an Oral Vaccine Vehicle Presenting Foreign Epitopes 55 , Virology 
(2002) 293:273-280; Lenz et al., "Papillomarivurs-Like Particles Induce Acute Activation of 
Dendritic Ceils", Journal of Immunology (2001) 5246-5355; Pinto, et al., "Cellular Immune 
Responses to Human Papillomavirus (HPV)-16 LI Healthy Volunteers Immunized with Recombinant 
HPV-16 LI Virus-Like Particles", Journal of Infectious Diseases (2003) 188:327-338; and Gerber et 
al., "Human Papillomavrisu Virus-Like Particles Are Efficient Oral Immunogens when 
Coadministered with Escherichia coli Heat-Labile Entertoxin Mutant R192G or CpG", Journal of 
Virology (2001) 75(10):4752-4760. Virosomes are discussed further in, for example, Gluck et al., 
"New Technology Platforms in the Development of Vaccines for the Future", Vaccine (2002) 20:B10 
-B16. Immunopotentiating reconstituted influenza virosomes (IRIV) are used as the subunit antigen 
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^^.TO product {Mischler & Metcalfe (2002) 

FacSW 20 Suppl 5:B17-23} and the INFLUVAC PLUS™ product 
E. Bacterial or Microbial Derivatives 

Adjuvants suitable for use in the invention include bacterial or microbial derivatives such as: 

(1) Non-toxic derivatives of enterobacterial lipopolysaccharide (LPS) 

Such derivatives include Monophosphoryl lipid A (MPL) and 3-O-deacylated MPL (3dMPL). 
3dMPL is a mixture of 3 De-O-acylated monophosphoryl lipid A with 4, 5 or 6 acylated chains. A 
preferred "small particle" form of 3 De-O-acylated monophosphoryl lipid A is disclosed in EP 0 689 
454. Such "small particles" of 3dMPL are small enough to be sterile filtered through a 0.22 micron 
membrane (see EP 0 689 454). Other non-toxic LPS derivatives include monophosphoryl lipid A 
mimics, such as aminoalkyl glucosaminide phosphate derivatives e.g. RC-529. See Johnson et ah 
(1999) Bioorg Med Chem Lett 9:2273-2278. 

(2) Lipid A Derivatives 

Lipid A derivatives include derivatives of lipid A from Escherichia coli such as OM-174. 
OM-174 is described for example in Meraldi et aL, "OM-174, a New Adjuvant with a Potential for 
Human Use, Induces a Protective Response with Administered with the Synthetic C-Terminal 
Fragment 242-310 from the circumsporozoite protein of Plasmodium berghei", Vaccine (2003) 
21:2485-2491; and Pajak, et al., "The Adjuvant OM-174 induces both the migration and maturation of 
murine dendritic cells in vivo", Vaccine (2003) 21: 836-842. 

(3) Immunostimulatory oligonucleotides 

Immunostimulatory oligonucleotides suitable for use as adjuvants in the invention include 
nucleotide sequences containing a CpG motif (a sequence containing an unmethylated cytosine 
followed by guanosine and linked by a phosphate bond). Bacterial double stranded RNA or 
oligonucleotides containing palindromic or poly(dG) sequences have also been shown to be 
immunostimulatory. 

The CpG 5 s can include nucleotide modifications/analogs such as phosphorothioate 
modifications and can be double-stranded or single-stranded. Optionally, the guanosine may be 
replaced with an analog such as 2'-deoxy-7-deazaguanosine. See Kandimalla, et al, "Divergent 
synthetic nucleotide motif recognition pattern: design and development of potent immunomodulatory 
oligodeoxyribonucleotide agents with distinct cytokine induction profiles", Nucleic Acids Research 
(2003) 31(9): 2393-2400; WO02/26757 and W099/62923 for examples of possible analog 
substitutions. The adjuvant effect of CpG oligonucleotides is further discussed inKrieg, "CpG motifs: 
the active ingredient in bacterial extracts?", Nature Medicine (2003) 9(7): 831-835; McCluskie, et al, 
"Parenteral and mucosal prime-boost immunization strategies in mice with hepatitis B surface antigen 
and CpG DNA" FEMS Immunology and Medical Microbiology (2002) 32:179-185; WO98/40100; 
US Patent No. 6,207,646; US Patent No. 6,239,116 and US Patent No. 6,429,199. 

The CpG sequence may be directed to TLR9, such as the motif GTCGTT or TTCGTT. See 

Kandimalla, et al, "Toll-like receptor 9: modulation of recognition and cytokine induction by novel 
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SYJithstic,,,CpG S^^ipchfaMSoaafy Transactions (2003) 31 (part 3): 654-658. The CpG 

sequence may be specific for inducing a Thl immune response, such as a CpG- A ODN, or it may be 
more specific for inducing a B cell response, such a CpG-B ODN. CpG-A and CpG-B ODNs are 
discussed in Blackwell, et aL, "CpG-A-Induced Monocyte IFN-gamma-Inducible Protein- 10 
Production is Regulated by Plasmacytoid Dendritic Cell Derived IFN-alpha", J. Immunol. (2003) 
170(8):406 1-4068; Krieg, "From A to Z on CpG", TRENDS in Immunology (2002) 23(2): 64-65 and 
WO01/95935. Preferably, the CpG is a CpG-A ODN. 

Preferably, the CpG oligonucleotide is constructed so that the 5' end is accessible for receptor 
recognition. Optionally, two CpG oligonucleotide sequences may be attached at their 3 ' ends to form 
"immunomers". See, for example, Kandimalla, et aL, "Secondary structures in CpG oligonucleotides 
affect immunostimulatory activity", BBRC (2003) 306:948-953; Kandimalla, et aL, "Toll-like 
receptor 9: modulation of recognition and cytokine induction by novel synthetic GpG DNAs", 
Biochemical Society Transactions (2003) 31(part 3):664-658; Bhagat et aL, "CpG penta- and 
hexadeoxyribonucleotides as potent immunomodulatory agents" BBRC (2003) 300:853-861 and WO 
03/035836. 

(4) ADP-ribosylating toxins and detoxified derivatives thereof. 

Bacterial ADP-ribosylating toxins and detoxified derivatives thereof may be used as 
adjuvants in the invention. Preferably, the protein is derived from E. coli (i.e., E. coli heat labile 
enterotoxin "LT), cholera ("CT"), or pertussis ("PT"). The use of detoxified ADP-ribosylating toxins 
as mucosal adjuvants is described in W095/1721 1 and as parenteral adjuvants in W098/42375. 
Preferably, the adjuvant is a detoxified LT mutant such as LT-K63, LT-R72, and LTR192G. The use 
of ADP-ribosylating toxins and detoxified derivaties thereof, particularly LT-K63 and LT-R72, as 
adjuvants can be found in the following references, each of which is specifically incorporated by 
reference herein in their entirety: Beignon, et aL, "The LTR72 Mutant of Heat-Labile Enterotoxin of 
Escherichia coli Enahnces the Ability of Peptide Antigens to Elicit CD4+ T Cells and Secrete Gamma " 
Interferon after Coapplication onto Bare Skin", Infection and Immunity (2002) 70(6):30 12-30 19; 
Pizza, et aL, "Mucosal vaccines: non toxic derivatives of LT and CT as mucosal adjuvants", Vaccine 
(2001) 19:2534-2541; Pizza, et aL, "LTK63 and LTR72, two mucosal adjuvants ready for clinical 
trials" Int. J. Med. Microbiol (2000) 290(4-5) :455-461; Scharton-Kersten et aL, "Transcutaneous 
Immunization with Bacterial ADP-Ribosylating Exotoxins, Subunits and Unrelated Adjuvants", 
Infection and Immunity (2000) 68(9):5306-53 13; Ryan et aL, "Mutants of Escherichia coli Heat- 
Labile Toxin Act as Effective Mucosal Adjuvants for Nasal Delivery of an Acellular Pertussis 
Vaccine: Differential Effects of the Nontoxic AB Complex and Enzyme Activity on Thl and Th2 
Cells" Infection and Immunity (1999) 67(1 2): 6270-6280; Partidos et aL, "Heat-labile enterotoxin of 
Escherichia coli and its site-directed mutant LTK63 enhance the proliferative and cytotoxic T-cell 
responses to intranasally co-immunized synthetic peptides", Immunol. Lett. (1999) 67(3):209-216; 
Peppoloni et aL, "Mutants of the Escherichia coli heat-labile enterotoxin as safe and strong adjuvants 

for intranasal delivery of vaccines", Vaccines (2003) 2(2):285-293; and Pine et aL, (2002) "Intranasal 
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Escherichia coli (LTK63)" J, Control Release (2002) 85(l-3):263-270. Numerical reference for amino 
acid substitutions is preferably based on the alignments of the A and B subunits of ADP-ribosylating 
toxins set forth in Domenighini et al., Mol, Microbiol (1995) 15(6): 1 165-1 167, specifically 
5 incorporated herein by reference in its entirety. 

F. Bioadhesives and Mucoadhesives 

Bioadhesives and mucoadhesives may also be used as adjuvants in the invention. Suitable 
bioadhesives include esterified hyaluronic acid microspheres (Singh et al. (2001) J. Cont. Rele. 
70:267-276) or mucoadhesives such as cross-linked derivatives of poly(acrylic acid), polyvinyl 
10 alcohol, polyvinyl pyrollidone, polysaccharides and carboxymethyicellulose. Chitosan and derivatives 
thereof may also be used as adjuvants in the invention. E.g. WO99/27960. 

G. Microparticles 

Microparticles may also be used as adjuvants in the invention. Microparticles (i.e. a particle 
of ~100nm to ~150um in diameter, more preferably ~200nm to ~30um in diameter, and most 
15 preferably ~500nm to ~10um in diameter) formed from materials that are biodegradable and 

non-toxic (e.g. a poly(a-hydroxy acid), a polyhydroxybutyric acid, a polyorthoester, a polyanhydride, 
a polycaprolactone, etc.), with poly(lactide-eo-glycolide) are preferred, optionally treated to have a 
negatively-charged surface (e.g. with SDS) or a positively-charged surface (e.g. with a cationic 
detergent, such as CTAB). 
20 H. Liposomes 

Examples of liposome formulations suitable for use as adjuvants are described in US Patent 
No. 6,090,406, US Patent No. 5,916,588, andEP 0 626 169. 

/. Polyoxy ethylene ether and Polyoxy ethylene Ester Formulations 

Adjuvants suitable for use in the invention include polyoxyethylene ethers and 
25 polyoxyethylene esters. W099/52549. Such formulations further include polyoxyethylene sorbitan 
ester surfactants in combination with an octoxynol (WOO 1/2 1207) as well as polyoxyethylene alkyl 
ethers or ester surfactants in combination with at least one additional non-ionic surfactant such as an 
octoxynol (WO 01/21 152). 

Preferred polyoxyethylene ethers are selected from the following group: polyoxyethylene-9- 
30 lauryl ether (laureth 9), polyoxyethylene-9-steoryl ether, polyoxytheylene-8-steoryl ether, 

polyoxyethylene-4-lauryl ether, polyoxyethylene-3 5 -lauryl ether, and polyoxyethylene-23 -lauryl 
ether. 

J. Polyphosphazene (PCPP) 

PCPP formulations are described, for example, in Andrianov et al., "Preparation of hydrogel 
35 microspheres by coacervation of aqueous polyphophazene solutions", Biomaterials (1998) 19(1- 

3): 109-1 15 and Payne et aL, "Protein Release from Polyphosphazene Matrices", Adv. Drug. Delivery 
Review (1998) 31(3): 185-1 96. 
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Examples of muramyl peptides suitable for use as adjuvants in the invention include N-acetyl- 
muramyl~L~threonyl-D-isoglutamine (thr-MDP), N-acetyl-normuramyl-l-alanyi-d-isoglutaniine (nor- 
MDP), and N-acetyimuramyl-l-alanyl-d-isoglutamm^ 
5 hydroxyphosphoryloxy)-ethylamine MTP-PE). 

L. Imidazoquinolone Compounds. 

Examples of imidazoquinolone compounds suitable for use adjuvants in the invention include 
Imiquamod and its homologues, described further in Stanley, "Imiquimod and the imidazoquinolones: 
mechanism of action and therapeutic potential" Clin Exp Dermatol (2002) 27(7):57 1-577 and Jones, 
10 "Resiquimod 33VT, Curr Opin Investig Drugs (2003) 4(2):214-218. 

The invention may also comprise combinations of aspects of one or more of the adjuvants 
identified above. For example, the following adjuvant compositions may be used in the invention: 

(1) a saponin and an oil-in-water emulsion (WO 99/1 1241); 

(2) a saponin (e.g.., QS21) + a non-toxic LPS derivative (e.g. 3dMPL) (see WO 94/00153); 
15 (3) a saponin (e.g.., QS21) + a non-toxic LPS derivative (e.g. 3dMPL) + a cholesterol; 

(4) a saponin (e.g. QS2 1 ) + 3 dMPL + IL- 1 2 (optionally + a sterol) (WO 98/57659); 

(5) combinations of 3 dMPL with, for example, QS21 and/or oil-in-water emulsions (See 
European patent applications 0835318, 0735898 and 0761231); 

(6) SAF, containing 10% Squalane, 0.4% Tween 80, 5% pluronic-block polymer L121, and thr- 
20 MDP, either microfluidized into a submicron emulsion or vortexed to generate a larger particle size 

emulsion. 

(7) Ribi™ adjuvant system (RAS), (Ribi Immunochem) containing 2% Squalene, 0.2% Tween 
80, and one or more bacterial cell wall components from the group consisting of monophosphorylipid 
A (MPL), trehalose dimycolate (TDM), and cell wall skeleton (CWS), preferably MPL + CWS 

25 (Detox™); 

(8) one or more mineral salts (such as an aluminum salt) + a non-toxic derivative of LPS (such as 
3dPML). 

(9) one or more mineral salts (such as an aluminum salt) + an immunostimulatory oligonucleotide 
(such as a nucleotide sequence including a CpG motif). Combination No. (9) is a preferred adjuvant 

30 combination. 

M. Human Immunomodulators 

Human immunomodulators suitable for use as adjuvants in the invention include cytokines, 
such as interleukins (e.g. IL-1, IL-2, IL-4, IL-5, IL-6, IL-7, IL-12, etc.), interferons (e.g. interferon-y), 
macrophage colony stimulating factor, and tumor necrosis factor. 
35 Aluminum salts and MF59 are preferred adjuvants for use with injectable influenza vaccines. 

Bacterial toxins and bioadhesives are preferred adjuvants for use with mucosally-delivered vaccines, 
such as nasal vaccines. 
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with an antibiotic treatment regime. In one embodiment, the antibiotic is administered prior to 
administration of the antigen of the invention or the composition comprising the one or more of the 
antigens of the invention. 

5 In another embodiment, the antibiotic is administered subsequent to the adminstration of the 

one or more antigens of the invention or the composition comprising the one or more antigens of the 
invention. Examples of antibiotics suitable for use in the treatment of the Steptococcal infections of 
the invention include but are not limited to penicillin or a derivative thereof or clindamycin or the 
like. 

10 Further antigens 

The compositions of the invention may further comprise one or more additional Gram 
positive bacterial antigens which are not associated with an AL Preferably, the Gram positive 
bacterial antigens that are not associated with an AI can provide protection across more than one 
serotype or strain isolate. For example, a first non-AI antigen, in which the first non-AI antigen is at 
15 least 90% {i.e., at least 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100%) homologous to the amino acid 
sequence of a second non-AI antigen, wherein the first and the second non-AI antigen are derived 
from the genomes of different serotypes of a Gram positive bacteria, may be further included in the 
compositions. The first non-AI antigen may also be homologous to the amino acid sequence of a 
third non-AI antigen, such that the first non-AI antigen, the second non-AI antigen, and the third non- 
20 AI antigen are derived from the genomes of different serotypes of a Gram positive bacteria. The first 
non-AI antigen may also be homologous to the amino acid sequence of a fourth non-AI antigen, such 
that the first non-AI antigen, the second non-AI antigen, the third non-AI antigen, and the fourth non- 
AI antigen are derived from the genomes of different serotypes of a Gram positive bacteria. 

The first non-AI antigen may be GBS 322. The amino acid sequence of GBS 322 across GBS 
25 strains from serotypes la, lb, II, III, V, and VIII is greater than 90%. Alternatively, the first non-AI 

antigen may be GBS 276. The amino acid sequence of GBS 276 across GBS strain from serotypes la, 
lb, II, III, V, and VIII is greater than 90%. Table 13 provides the percent amino acid sequence 
identity of GBS 322 and GBS 276 across different GBS strains and serotypes. 



Table 13: Conservation of GBS 322 and GBS 276 amino acid sequences 



Serotype 


Strains 


GBS 322 


GBS 276 






cGH 


%AA identity 


cGH 


%AA identity 


la 


090 




98.60 




97.90 


A909 


+ 


98.30 


+ 


97.90 


515 


+ 


98.80 


+ 


97.50 


DKl 






+ 




DK8 


+ 




+ 




Davis 






+ 




lb 


7357b 


+ 








H36B 




98.30 




97.80 


n 


18RS21 


+ 


100.00 


+ 


99.90 


DK21 






4- 
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GBS 276 


PC'l s 


H '!«:n Ltt "3 «»■» 


lliiiif IMlV**t Ti|» 

cOri 


%AA identity 


cGH 


% AA identity 


m 


NEM316 


+ 


100.00 


+ 


97.00 


COH31 


+ 




+ 




D136 


+ 




+ 




M732 


+ 


98.00 


+ 


100.00 


COH1 


+ 


98.30 


+ 


100.00 


M781 


+ 


98.30 


+ 


99.60 


No type 


CJB110 


+ 


98.60 


+ 


97.90 


1169NT 


+ 


97.40 


+ 


97.90 


V 


CJB111 


+ 


100.00 


+ 




2603 


+ 


100.00 


+ 


100.00 


vra 


JM130013 


+ 


100.00 




97.90 


SMU014 






+ 




total 


22/22 


98.28+/-0.4 


22/22 


98.44 +/-1. 094 



As an example, inclusion of a non-AI protein, GBS 322, in combination with AI antigens 
GBS 67, GBS 80, and GBS 104 provided protection to newborn mice in an active maternal 
immunization assay. 

Table 14: Active maternal immunization assay for a combination of fragments from GBS 322, GBS 80, GBS 104, and GBS 67 



i ' - ■ 

1 

r 

j 


t 








G8S strains 


i 

iType 


GBS 80 \&BS 67 

...... ■ WWW*]" " JB"— ..^.Z 


GBS 322 


i 

alive/treatedj% protection 


alive/ treated \% protection 


515 


la 


0 | 409 

i 


227 


39/40 ^ 


97 


6/40 15 


7357b- ~ 


1 Tb 


91 j 316 


102 


19/30 


63 


~ 1/30 ~ 




DK21 


1 II 


0 | 331 


416 


25/34 


73 


17/48 


_ 35 


5401 


h _ 




170 | 618 


135 


35/40 


87 


3/37 


8 


3050 
M781 


II 



Ill 
III 


43 
305 
65 


| 460 

!■».*— .1—.— —■ — 


188 


48/48 


100 


1/30 


3 


0 


130 
224 


36/36 
30/40 


100 

~ 75 


7/40 

— — — 


17 
10 


2603 


V 


125 


105 


313 


27/33 
25/28 " 


82 


10/35 


28 


CJB111 


V 


370 | 481 


63 


89 


4/46 


9 


JM9130013 


vin 


597 


83 


143 


37/39 


95 


5/40 


12 


JMU071 1 


VIII 


556 


79 


170 


44/50 | 


88 


1 8/50 


~ 36 


NTH 69 1 


NT 


0 


443 


213 


12/32 j 


37 


11/35 


31 



In fact, the non-AI GBS 322 antigen may itself provide protection to newborn mice in an 
active maternal immunization assay. 
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" TMble 16: Active maternal immunization assay for each of GBS 80 and GBS 322 antigens 



i 1 
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GBS 80 


GBS 322 


i i 
1 

[ 
r 

i 


FACS 


Protection (% survival) 


FACS 


Protection (% survival) 


GBS strains 


S Type 


A Mean 


antigen 


ctrl- 

■ 


A Mean 


antigen 


CTTI — 


C TBI 1 1 


V 


370 


72 % 


40% 


63 


57% 


40% ! 


^COHl 


j 


305 


76 % 




130 


3% 


1 — ■ 

10% 


™ 2603 


V 


82 


22 % 


_ 34% 


313 


83 % 


L— 34% L 


7357b- 


lb 


91 


36% 


34% 


102 


43% 


34% 


18RS21 


5T 


0 


15% 


24% 


268 


84 % 


24% 


DK21 


if 


0 


10% 


21% 


416 


67 % j 25% 


A909 


la 


0 


0% 


14% 




i 

i 
: 


090 




0 


0% 


0% 








H36B 










105 


34% 


32%, 



Thus, inclusion of a non-AI protein in an immunogenic composition of the invention may provide 
increased protection a mammal. 

The immunogenic compositions comprising S. pneumonaie AI polypeptides may further 
secondary SP protein antigens which include (a) any of the SP protein antigens disclosed in WO 

. 02/077021 or U.S. provisional application , filed April 20, 2005 (Attorney Docket 

Number 00244 LOO 154), (2) immunogenic portions of the antigens comprising at least 7 
contiguous amino acids, (3) proteins comprising amino acid sequences which retain 
immunogenicity and which are at least 95% identical to these SP protein antigens (e.g., 95%, 
96%, 97%, 98%, 99%, or 99.5% identical), and (4) fusion proteins, including hybrid SP protein 
antigens, comprising (l)-(3). 

Alternatively, the invention may include an immunogenic composition comprising a first and 
a second Gram positive bacteria non-AI protein, wherein the polynucleotide sequence encoding the 
sequence of the first non-AI protein is less than 90% (i.e., less than 90, 88, 86, 84, 82, 81, 78, 76, 74, 
72, 70, 65, 60, 55, 50, 45, 40, 35, or 30 percent) homologous than the corresponding sequence in the 
genome of the second non-AI protein. 

The compositions of the invention may further comprise one or more additional non-Gram 
positive bacterial antigens, including additional bacterial, viral or parasitic antigens. The 
compositions of the invention may further comprise one or more additional non-GBS antigens, 
including additional bacterial, viral or parasitic antigens. 

In another embodiment, the GBS antigen combinations of the invention are combined with 
one or more additional, non-GBS antigens suitable for use in a vaccine designed to protect elderly or 
immunocomprised individuals. For example, the GBS antigen combinations may be combined with 
an antigen derived from the group consisting of Enterococcus faecalis, Staphylococcus aureus, 
Staphylococcus epidermis, Pseudomonas aeruginosa, Legionella pneumophila, Listeria 
monocytogenes, Neisseria meningitides, influenza, and Parainfluenza virus ('PIV'). 
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p'r F^^!ltfW & ff:^Wi te anti g en is used ' " is Preferably conjugated to a carrier 
protein in order to enhance immunogenicity {e.g. Ramsay et al. (2001) Lancet 357(925 1): 195-196; 
Lindberg (1999) Vaccine 17 Suppl 2:S28-36; Buttery & Moxon (2000) JR Coll Physicians Lond 
34:163-168; Ahmad & Chapnick (1999) Infect Dis Clin North Am 13:1 13-133, vii.; Goidblatt (1998) 
J. Med. Microbiol 47:563-567; European patent 0 477 508; US Patent No. 5,306,492; International 
patent application W098/42721; Conjugate Vaccines (eds. Cruse et al) ISBN 3805549326, 
particularly vol. 10:48-1 14; and Hermanson (1996) Bioconjugate Techniques ISBN: 0123423368 or 
012342335X}. Preferred carrier proteins are bacterial toxins or toxoids, such as diphtheria or tetanus 
toxoids. The CRM 197 diphtheria toxoid is particularly preferred {Research Disclosure, 453077 (Jan 
2002)} . Other carrier polypeptides include the N. meningitidis outer membrane protein (EP-A- 
0372501), synthetic peptides (EP-A-0378881; EP-A-0427347), heat shock proteins (WO 93/17712; 
WO 94/03208), pertussis proteins (WO 98/58668; EP A 0471 177), protein D from H.influenzae (WO 
00/56360), cytokines (WO 91/01 146), lymphokines, hormones, growth factors, toxin A or B from 
Cdifficile (WO00/61761), iron-uptake proteins (WO01/72337), etc. Where a mixture comprises 
capsular saccharides from both serogroups A and C, it may be preferred that the ratio (w/w) of MenA 
saccharide:MenC saccharide is greater than 1 (e.g. 2:1, 3:1, 4:1, 5:1, 10:1 or higher). Different 
saccharides can be conjugated to the same or different type of carrier protein. Any suitable 
conjugation reaction can be used, with any suitable linker where necessary. 

Toxic protein antigens may be detoxified where necessary e.g. detoxification of pertussis 
toxin by chemical and/or genetic means. 

Where a diphtheria antigen is included in the composition it is preferred also to include 
tetanus antigen and pertussis antigens. Similarly, where a tetanus antigen is included it is preferred 
also to include diphtheria and pertussis antigens. Similarly, where a pertussis antigen is included it is 
preferred also to include diphtheria and tetanus antigens. 

Antigens in the composition will typically be present at a concentration of at least 1 ug/ml 
each. In general, the concentration of any given antigen will be sufficient to elicit an immune 
response against that antigen. 

As an alternative to using protein antigens in the composition of the invention, nucleic acid 
encoding the antigen may be used {e.g. refs. Robinson & Torres (1997) Seminars in Immunology 
9:271-283; Donnelly et al (1997) Annu Rev Immunol 15:617-648; Scott-Taylor & Dalgleish (2000) 
Expert Opin Investig Drugs 9:471-480; Apostolopoulos & Plebanski (2000) Curr Opin Mol Titer 
2:441-447; Ilan (1999) Curr Opin Mol Ther 1:116-120; Dubensky et al (2000) Mol Med 6:723-732; 
Robinson & Pertmer (2000) Adv Virus Res 55: 1-74; Donnelly et al (2000) Am JRespir Crit Care 
Med 162(4 Pt 2):S190-193; and Davis (1999) ML Sinai J. Med. 66:84-90}. Protein components of the 
compositions of the invention may thus be replaced by nucleic acid (preferably DNA e.g. in the form 
of a plasmid) that encodes the protein. 
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The term "comprising" means "including" as well as "consisting" e.g. a composition 
"comprising" X may consist exclusively of X or may include something additional e.g. X + Y. 
The term "about" in relation to a numerical value x means, for example, x+10%. 
5 References to a percentage sequence identity between two amino acid sequences means that, 

when aligned, that percentage of amino acids are the same in comparing the two sequences. This 
alignment and the percent homology or sequence identity can be determined using software programs 
known in the art, for example those described in section 7.7.18 of Current Protocols in Molecular 
Biology (F.M. Ausubel et ah, eds., 1987) Supplement 30. A preferred alignment is determined by the 
10 Smith- Waterman homology search algorithm using an affine gap search with a gap open penalty of 12 
and a gap extension penalty of 2, BLOSUM matrix of 62. The Smith- Waterman homology search 
algorithm is disclosed in Smith & Waterman (1981) Adv. Appl Math. 2: 482-489. 

The invention is further illustrated, without limitation, by the following examples. 
EXAMPLE 1: Binding of an Adhesin Island surface protein, GBS 80, to Fibrinogen and 
1 5 Fibronectin. 

This example demonstrates that an Adhesin Island surface protein, GBS 80 can bind to 
fibrinogen and fibronectin. 

i 

An enzyme-linked immunosorbent assay (ELISA) was used to analyse the in vitro binding 
ability of recombinant GBS 80 to immobilized extra-cellular matrix (ECM) proteins but not to bovine 

20 serum albumin (BSA). Microtiter plates were coated with ECM proteins (fibrinogen, fibronectin, 
laminin, collagen type IV) and binding assessed by adding varying concentrations of a recombinant 
form of GBS 80, over-expressed and purified from E. coli (FIGURE 5 A). Plates were then incubated 
sequentially with a) mouse anti-GBS 80 primary antibody; b) rabbit anti-mouse AP-conjugated 
secondary antibody; c) pNPP colorimetric substrate. Relative binding was measured by monitoring 

25 absorbance at 405 nm, using 595 nm as a reference wavelength. Figure 5b shows binding of 
recombinant GBS 80 to immobilized ECM proteins (1 jig) as a function of concentration of GBS 80. 
BSA was used as a negative control. Data points represent the means of OD 40 5 values ± standard 
deviation for 3 wells. 

Binding of GBS 80 to the tested ECM proteins was found to be concentration dependent and 
30 exhibited saturation kinetics. As is also evident from FIGURE 5, binding of GBS 80 to fibronectin 
and fibrinogen was greater than binding to laminin and collagen type IV at all the concentrations 
tested. 

EXAMPLE 2: GBS 80 is required for surface localization of GBS 104. 

This example demonstrates that co-expression of GBS 80 is required for surface localization 
35 of GBS 104. 

The polycistronic nature of the Adhesin Island I mRNA was investigated through reverse 

transcriptase-PCR (RT-PCR) analysis employing primers designed to detect transcripts arising from 

contiguous genes. Total RNA was isolated from GBS cultures grown to an optical density at 600 nm 
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( 0 P 6 Sp)ifi f 9*|? iffiJHB^ by the RNeasy Total RNA isolation method (Qiagen) 

accoi'ding to the manufacturer's instructions. The absence of contaminating chromosomal DNA was 
confirmed by failure of the gene amplification reactions to generate a product detectable by agarose 
gel electrophoresis, in the absence of reverse transcriptase. RT-PCR analysis was performed with the 
5 Access RT-PCR system (Promega) according to the manufacturer's instructions, employing PGR 
cycling temperatures of 60°C for annealing and 70°C for extension. Amplification products were 
visualized alongside 100-bp DNA markers in 2% agarose gels after ethidium bromide staining. 

FIGURE 5 shows that all the genes are co-transcribed as an operon. A schematic of the AI-1 
operon is shown above the agarose gel analysis of the RT-PCR products. Large rectangular arrows 

10 indicate the predicted transcript direction. Primer pairs were selected such as "1-4" cross the 3'finish- 
5'start of successive genes and overlap each gene by at least 200 bp. Additionally, "1" crosses a 
putative rho-independent transcriptional terminator. "5" is an internal GBS 80 control and "6" is an 
unrelated control from a highly expressed gene. Lanes: "a": RNA plus RTase enzyme; "b" RNA 
without RTase; "c": genomic DNA control. 

15 In the effort to elucidate the functions of the AI-1 proteins, in frame deletions of all of the 

genes within the operon have been constructed and the resulting mutants characterized with respect to 
surface exposure of the encoded antigens (see FIGURE 8). 

Each in- frame deletion mutation was constructed by splice overlap extension PCR (SOE- 
PCR) essentially as decribed by Horton et al. [Horton R. M., Z. L. Cai, S. N. Ho, L. R. Pease (1990) 

20 Biotechniques 8:528-35]. using suitable primers and cloned into the temperature sensitive shuttle 

vector pJRS233 to replace the wild type copy by allelic exchange [Perez-Casal, L, J. A. Price, et al. 
(1993) Mol Microbiol 8(5): 809-19.]. All plasmid constructions utilized standard molecular biology 
techniques, and the identities of DNA fragments generated by PCR were verified by sequencing. 
Following SOE-PCR, the resulting mutant DNA fragments were digested with Xhol and EcoRI, and 

25 ligated into a similarly digested pJRS233. The resuting vectors were introduced by electroporation 

into the chromosome of 2603 and COH1 GBS strains in a three-step process, essentially as described 
in Framson et al. [Framson, P. E., A. Nittayajarn, J. Merry, P. Youngman, and C. E. Rubens. (1997) 
Appl. Environ. Microbiol. 63(9):3539-47]. Briefly, the vector pJRS233 contains an erm gene 
encoding erythromycin resistance and a temperature-sensitive gram-positive replicon that is active at 

30 30°C but not at 37°C. Initially, the constructs are electroporated into GBS electro-competent cells 
prepared as described by Frameson et al., and transformants containing free plasmid are selected by 
their ability to grow at 30°C on Todd-Hewitt Broth (THB) agar plates containing 1 p,g/ml 
erythromycin. The second step includes a selection step for strains in which the plasmid has integrated 
into the chromosome via a single recombination event over the homologous plasmid insert and 

35 chromosome sequence by their ability to grow at 37°C on THB agar medium containing 1 mg/ml 

erythromycin. In the third step, GBS cells containing the plasmid integrated within the chromosome 
(integrants) are serially passed in broth culture in the absence of antibiotics at 30°C. Plasmid excision 
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eahJr'completea'the allelic exchange or reconstituted the wild-type genotype. Subsequent loss of the 
plasmid in the absence of antibiotic selection pressure resulted in an erythromycin-sensitive 
phenotype. In order to assess gene replacement a screening of erythromycin-sensitive colonies was 
performed by analysis of the target gene PCR amplicons. 

FIGURE 7 reports a schematic of the IS-1 operon for each knock-out strain generated, along 
with the deletion position within the amino acidic sequence. Most data presented here concern the 
COH1 deletion strains, in which the expression of each of the antigens is higher by DNA microarray 
analysis (data not shown) as well as detectable by FACS analysis (see FIGURE 8). The double mutant 
in 2603 A80, A104 double mutant was constructed by sequential allelic exchanges of the shown 
alleles. 

Immunization protocol 

Immune sera for FACS experiments were obtained as follows. 

Groups of 4 CD-I outbred female mice 6-7 weeks old (Charles River Laboratories, Calco 
Italy) were immunized with the selected GBS antigens, (20 ug of each recombinant GBS antigen), 
suspended in 100 ul of PBS. Each group received 3 doses at days 0, 21 and 35. Immunization was 
performed through intra-peritoneal injection of the protein with an equal volume of Complete 
Freund's Adjuvant (CFA) for the first dose and Incomplete Freund's Adjuvant (IFA) for the following 
two doses. In each immunization scheme negative and positive control groups are used. Immune 
response was monitored by using serum samples taken on day 0 and 49. 

FACS analysis 

Preparation of paraformaldehyde treated GBS cells and their FACS analysis were carried out 
as follows. 

GBS serotype COH1 strain cells were grown in Todd Hewitt Broth (THB; Difco 
Laboratories, Detroit, Mich.) to OD600mn = 0.5. The culture was centrifuged for 20 minutes at 5000 
rpm and bacteria were washed once with PBS, resuspended in PBS containing 0.05% 
paraformaldehyde, and incubated for 1 hours at 37 °C and then overnight at 4°Q 50^1 of fixed 
bacteria (OD600 0. 1) were washed once with PBS, resuspended in 20ul of Newborn Calf Serum, 
(Sigma) and incubated for 20 min. at room temperature. The cells were then incubated for 1 hour at 
4°C in lOOul of preimmune or immune sera, diluted 1:200 in dilution buffer (PBS, 20% Newborn 
Calf Serum, 0.1% BSA). After centrifugation and washing with 200jli1 of washing buffer (0.1% BSA 
in PBS), samples were incubated for l hour at 4°C with 50{-d of R-Phicoerytrin conjugated F(ab)2 
goat anti-mouse IgG (Jackson ImmunoResearch Laboratories; Inc.), diluted 1 : 100 in dilution buffer. 
Cells were washed with 200ul of washing buffer and resuspended in 200ul of PBS. Samples were 
analysed using a FACS Calibur apparatus (Becton Dickinson, Mountain View, Calf.) and data were 
analyzed using the Cell Quest Software (Becton Dickinson). A shift in mean fluorescence intensity of 
> 75 channels compared to preimmune sera from the same mice was considered positive. This cutoff 
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W 5m. SSSFWffPW jmjmtoj|«idanl deviations of shifts obtained with control sera raised 

against mock purified recombinant proteins from cultures of E. coli carrying the empty expression 
vector and included in every experiment. Artifacts due to bacterial lysis were excluded using antisera 
raised against 6 different known cytoplasmic proteins all of which were negative 

FACS data on COH1 single KO mutants for GBS 104 and GBS 80 indicated that GBS 80 is 
required for surface localization of GBS 104. 

As shown in FIGURE 8, GBS 104 is not surface exposed in the A80 strain (second column, 
bottom), but is present in the whole protein extracts (see FIGURE 10). Mean shift values suggest that 
GBS 104 is partially responsible for GBS 80 surface exposure (Mean shift of GBS 80 is reduced to 
-60% wild-type levels in A104), and that GBS 80 is over-expressed in the complemented strain (mean 
shift value -200% wild-type level). The A80/pGBS 80 strain contains the GBS 80 orf cloned in the 
shuttle-vector pAM401 (Wirth, R., F. Y. An, et al. (1986). J Bacteriol 165(3): 831-6). The vector 
alone does not alter the secretion pattern of GBS 104 (right column). FACS was performed on mid- 
log fixed bacteria with mouse polyclonal antibodies as indicated at left. Black peak is pre-immune 
sera, colored peaks are sera from immunized animals. 

EXAMPLE 3: Deletion of GBS 80 causes attenuation in vivo. 

This example demonstrates that deletion of GBS 80 causes attenuation in vivo, suggesting that 
this protein contributes to bacterial virulence. 

By using a mouse animal model, we studied the role of GBS 80 and GBS 104 in the virulence 
of S. agalactiae. 

Groups of ten outbred female mice 5-6 week weeks old (Charles River Laboratories, Calco 
Italy) were inoculated intraperitoneally with different dilutions of the mutant strains and LD50 (lethal 
dose 50) were calculated according to the method of Reed and Muench [Reed, L. J. and H. Muench 
(1938).The American Journal of Hygiene 27(3): 493-7]. As presented in the table below the number 
of colony forming units (cfu) counted for both the A80 and the A80, A104 double mutants is about 10 
fold higher when compared to the wild type strain suggesting that inactivation of GBS 80 but not GBS 
104 is responsible for an attenuation in virulence. This finding indicates that GBS 80 gene in the AI-1 
might contribute to virulence. 

TaMe Lethal dose 50% analysis of AI-1 mutants in the 2603 strain background. LD50s were 
performed by IP injection of female CD1 mice at an age of 5-6 weeks. LD50s were calculated by the 
method of Reed and Muench (8). 



GBS strain 



Wild Type 2603 

A 104 mutant 



A80 mutant 



A80, A104 double mutant 



LD 5Q , cfu 



2x 10* 



a x io 8 



2.6 x 10 9 



a x io 9 



Number of Experiments 



4 



1 



EXAMPLE 4: Effect of Adhesin Island Sortase Deletions on Surface Antigen Presentation 
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presentation. 

FACS analysis results set forth in FIGURE 9 show that a deletion in sortase S AG0648 
prevented GBS 104 from reaching the surface and slightly reduced the surface exposure of GBS 80 
5 (fourth panel; mean shift value -60% wild-type COH1). In the double sortase knock-out strain, 

neither antigen was surface exposed (far right panel). Either sortase alone was sufficient for GBS 80 
to arrive at the bacterial surface (third and fourth columns, top). No effect was seen on surface 
exposure of antigens GBS 80 or GBS 104 in the AGBS 52 strain. Antibodies derived from purified 
GBS 52 were either non-specific or were FACS negative for GBS 52 (data not shown). FACS 
10 analysis was performed as described above (see EXAMPLE 2). 

As shown in FIGURE 10, inactivation of GBS 80 has no effect on GBS 104 expression as 
much as GBS 104 knock out doesn't change the total amount GBS 80 expressed. The Western blot of 
whole protein extracts (strains noted above lanes) probed with anti-GBS 80 antisera is shown in panel 
A. Arrow indicates expected size of GBS 80 (60 kDa). GBS 80 antibodies recognize a doublet, the 
15 lower band is not present in AGBS 80 strains. Panel B shows a Western blot of whole protein extracts 
probed with anti-GBS 104 antisera. Arrow indicates expected size of GBS 104 (99.4 kDa). Protein 
extracts were prepared from the same bacterial cultures used for FACS (FIGURES 8 and 9). In 
conclusion, although GBS 104 does not arrive at the surface in the A80 strain by FACS (FIGURE 8, 
second column), it is present at approximately wild-type levels in the whole protein preps (B, second 
20 lane). Approximately 20 \ig of each protein extract was loaded per lane. 

Western-blot analysis 

Aliquots of total protein extract mixed with SDS loading buffer (lx: 60 mM TRIS-HC1 pH 
6.8, 5% w/v SDS, 10% v/v glycerin, 0.1% Bromophenol Blue, 100 mM DTT) and boiled 5 minutes at 
95° C, were loaded on a 12.5% SDS-PAGE precast gel (Biorad). The gel is run using a SDS-PAGE 

25 running buffer containing 250 mM TRIS, 2.5 mM Glycine and 0.1 %SDS. The gel is electroblotted 
onto nitrocellulose membrane at 200 mA for 60 minutes. The membrane is blocked for 60 minutes 
with PBS/0.05 % Tween-20 (Sigma), 10% skimmed milk powder and incubated O/N at 4° C with 
PBS/0.05 % Tween 20, 1% skimmed milk powder, with the appropriate dilution of the sera. After 
washing twice with PBS/0.05 % Tween, the membrane is incubated for 2 hours with peroxidase- 

30 conjugated secondary anti-mouse antibody (Amersham) diluted 1:4000. The nitrocellulose is washed 
three times for 10 minutes with PBS/0.05 % Tween and once with PBS and thereafter developed by 
Opti-4CN Substrate Kit (Biorad). 

Example 5: Binding of Adhesin Island proteins to epithelial cells and effect of Adhesin Island 
proteins on capacity of GBS to adhere to epithelial cells. 

35 This example illustrates the binding of AI proteins to epithelial cells and the effect of AI 

proteins on the capacity of GBS to adhere to epithelial cells. , 

Applicants analysed whether recombinant AI surface proteins GBS 80 or GBS 104 would 

demonstrate binding to various epithelial cells in a FACS analysis. Applicants also analysed whether 
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As shown in Figure 28, deletion of GBS 80 sequence from GBS strain isolate 2603 (serotype 
V) did not affect the capacity of the mutated GBS to adhere to and invade ME180 cervical epithelial 
5 cells. Here ME180 cervical carcinoma epithelial cells were infected with wild type GBS 2603 or 

GBS 2603 A80 isogenic mutant. After two hours of infection, non-adherent bacteria were washed off 
and infection prolonged for a further two hours and four hours. In invasion experiments, after each 
time point, was followed by a two hour antibiotic treatment. Cells were then lysed with 1% saponin 
and lysates platedon TSA plates. As shown in Figure 28, there was little difference between the 
10 percent invasion or percent adhesion of wild type and mutant strains up to the four hour time point. 

Figure 30 repeats this experiment with both A 104 and A 80 mutants from a different strain 
isolate. Here, ME180 cervical carcinoma epithelial cells were infected with GBS strain isolate COH 
(serotype III) wild type or COH1 AGBS 104 or COH1 A80 isogenic mutant. After one hour of 
infection, non-adherent bacteria were washed off and the cells were lysed with 1% saponin. The 
15 lysates were plated on TSA plates. As shown in Figure 30, while there was little difference in the 
percent invasion, there was a significant decrease in the percent association of the A 104 mutant 
compared to both the wild type and A80 mutant. 

The affect of AI surface proteins on the ability of GBS to translocate through an epithelial 
monolayer was also analysed. As shown in Figure 3 1, a GBS 80 knockout mutant strain partially 
. 20 loses the ability to translocate through an epithelial monolayer. Here epithelial monolayers were 
inoculated with wildtype or knockout mutant in the apical chamber of a transwell system for two 
hours and then non-adherent bacteria were washed off. Infection was prolonged for a further two and 
four hours. Samples were taken from the media of the basolateral side and the number of colony 
forming unties measured. Transepithelial electrical resistance measured prior to and after infection 
25 gave comparable values, indicating the maintenance of the integrity of the monolayer. By the six 
hour time point, the A 80 mutants demonstrated a reduced percent transcytosis. 

A similar experiment was conducted with GBS 104 knock out mutants. Here, as shown in 
Figure 22, the A104 mutants also demonstrated a reduced percent transcytosis, indicating that the 
mutant strains translocate through an epithelial monolayer less efficiently than their isogenic wild type 
30 counterparts. 

Applicants also studied the effect of AI proteins on the capacity of a GBS strain to invade 
J774 macrophage-like cells. Here, J774 cells were infected with GBS COH1 wild type or COH1 
AGBS104 or COH1 AGBS80 isogenic mutants. After one hour of infection, non-adherent bacteria 
were washed off and intracellular bacteria were recovered at two, four and six hours post antibiotic 
35 treatment. At each time point, cells were lysed with 0.25% Triton X-100 and lysates plated on TSA 
plates. As shown in Figure 32, the A 104 mutant demonstrated a significantly reduced percent 
invasion compared to both the wild type and A80 mutant. 
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This example illustrates hyperoligomeric structures comprising AI surface proteins GBS 80 
and GBS 104. A GBS isolate COH1 (serotype III) was adapted to increase expression of GBS 80. 
Figure 34 presents a regular negative stain electron micrograph of this mutant; no pilus or 
hyperoligomeric structures are distinguishable on the surface of the bacteria. When the EM stain is 
based on anti-GBS 80 antibodies labelled with 10 or 20 nm gold particles, the presence of GBS 80 
throughout the hyperoligomeric structure is clearly indicated (Figures 36, 37 and 38). EM staining 
against GBS 104 (anti-GBS 104 antibodies labelled with 10 nm gold particles) also reveals the 
presence of GBS 104 primarily on or near the surface of the bacteria or potentially associated with 
bacterial peptidoglycans (Figure 39). Analysis of this same strain (over-expressing GBS 80) with a 
combination of both anti-GBS 80 (using 20 nm gold particles) and anti-GBS 104 (using 10 nm gold 
particles) reveals the presence of GBS 104 on the surface and within the hyperoligomeric structures 
(see Figures 40 and 41). 

Example 7: GBS 80 is necessary for polymer formation and GBS 104 and sortase SAG0648 are 
necessary for efficient pili assembly 

This example demonstrates that GBS 80 is necessary for formation of polymers and that GBS 
104 and sortase SAG0648 are necessary for efficient pili assembly. GBS 80 and GBS 104 polymeric 
assembly was systematically analyzed in Cohl strain single knock out mutants of each of the relevant 
coding genes in AI-1 (GBS 80, GBS 104, GBS 52, sag0647, and sag0648). Figure 41 provides 
Western blots of total protein extracts (strains noted above lanes) probed with either anti-GBS 80 (left 
panel) sera or anti-GBS 104 sera (right panel) for each of these Cohl and Cohl knock out strains. 
(Cohl, wild type Cohl; A80, Cohl with GBS 80 knocked out; A104, Cohl with GBS 104 knocked 
out; A52, Cohl with GBS 52 knocked out; A647, Cohl with SAG0647 knocked out; A648, Cohl with 
SAG0648 knocked out, A647-8, Cohl with SAG0647 and SAG0648 knocked out; A80/pGBS80, 
Cohl with GBS 80 knocked out but complemented with a high copy number plasmid expressing GBS 
80. Asterisks identify the monomer of GBS 80 and GBS 104.) 

The smear of immunoreactive material observed in the wild type strain, along with its 
disappearance in A80 and A104 mutants, is consistent with the notion that such high molecular weight 
structures are composed of covalently linked (SDS-resistant) GBS 80 and GBS 104 subunits. The 
immunoblotting with both anti-GBS 80 (oc-GBS 80) and anti-GBS 104 (a-GBS 104) revealed that 
deletion of sortase SAG0648 also interferes with the assembly of high molecular weight species, 
whereas the knock out mutant of the second sortase (SAG0647), even if somehow reduced, still 
maintains the ability to form polymeric structures. 

Total extracts form GBS were prepared as follows. Bacteria were grown in 50 ml of Todd- 
Hewitt broth (Difco) to an OD 600 nm of 0.5-0.6 and successively pelleted. After two washes in PBS the 
pellet was resuspended and incubated 3 hours at 37°C with mutanolisin. Cells were then lysed with at 
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least three freezing-thawing cycles in dry ice and a 37°C bath. The lysate was then centrifuged to 

eliminate the cellular clebris aiia the supernatant was quantified. Approximately 40 jxg of each protein 
extract was separated on SDS-PAGE. The gel was then subjected to immunoblotting with mice 
antisera and detected with chemiluminescence. 

5 

Example 8: GBS 80 is polymerized by an AI-2 sortase 

This example illustrates that GBS 80 can be polymerized not only by AI-1 sortases, but also 
by AI-2 sortases. Figure 42 shows total cell extract immunoblots of GBS 515 strain, which lacks AI- 
1. The left panel, where an anti-GBS 67 sera was used, shows that GBS 67 from AI-2 is assembled 

10 into high-molecular weight-complexes, suggesting the fonnation of a second type of pilus. The same 
high molecular structure is observed when GBS 80 is highly expressed by reintroducing the gene 
within a plasmid (pGBS 80). By using anti-GBS 80 (right panel) sera on the same extracts, again it is 
observed that, with GBS 80 over expression (515/pGBS 80), a high-molecular weight structure is 
assembled. This implies that, in the absence of AI-1 sortases, AI-2 sortases (SAG1405 and SAG1406) 

15 can complement the lacking function, still being able to assemble GBS 80 in a pilus structure. 

Example 9: Cohl produces a high molecular weight molecule, the GBS 80 pilin 

This example illustrates that Cohl produces a high molecular weight molecule, greater than 
1000 kDa, which is the GBS 80 pilin. Figure 43 provides silver-stained electrophoretic gels that show 

20 that Cohl produces two macromolecules. One of these macromolecules disappears in the Cohl GBS 
80 knock out cells, but does not disappear in the Cohl GBS 52 knock out mutant cells. The last two 
lanes on the right were loaded with 1 5 times the amount loaded in the other lanes. This was done in 
order to be able to count the bands. By doing this, a conservative size estimate of the top bands was 
calculated by starting at 240 kDa and considering each of 14 higher bands as the result of consecutive 

25 additions of a GBS 80 monomer. 

Cohl, wild type Cohl; A80, Cohl cells with GBS 80 knocked out; A52, Cohl cells with GBS 
52 knocked out; A80/pGBS 80, Cohl cells with GBS 80 knocked out and complemented with a high 
copy number construct expressing GBS 80. 

30 Example 10. GBS 52 is a minor component of the GBS pilus 

This example illustrates that GBS 52 is present in the GBS pilus and is a minor component of 
the pilus. Figure 45 shows an immunoblot of total cell extracts from a GBS Cohl strain and a GBS 
Cohl strain knocked out for GBS 52 (A52). The total cell extracts were immunoblotted anti-GBS 80 
antisera (left) and anti-GBS 52 antisera (right). Immunoblotting was performed using a 3-8% Tris- 
35 acetate polyacrylamide gel (Invitrogen) which provided excellent separation of large molecular 

weight proteins (see figure 41). When the gel was incubated with anti-GBS 80 sera, the bands from 
the Cohl wild-type strain appeared shifted when compared to the A52 mutant. This observation 
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was stripped and incubated with anti-GBS 52 sera the high-molecular subunits in the Cohl wild-type 
strain showed similar molecular size of those in the correspondent lane in the left panel. These 
findings confirmed that GBS 52 is indeed associated with GBS 80 macro-molecular structures but 
represents a minor component of the GBS pilus. 



Example 11: Pilus structures are present in the supernatant of GBS bacterial cultures 

This example illustrates that the pilus structure assembled in Cohl GBS is present in the 
supernatant of a bacterial cell culture. Figure 46 shows an immunoblot where the protein extract of 
the supernatant from cultures of different GBS mutant strains (117 = Cohl GBS 80 knockout; 159= 
Cohl GBS 104 knockout; 202= Cohl GBS 52 knockout; 206= Cohl GBS sag0647 knockout; 208= 
Cohl GBS sag0648 knockout; 197= Cohl GBS sag0647/sag0648 knockout; 179= Cohl GBS 80 
knockout complemented with a high copy plasmid expressing GBS 80). GBS 80 antisera detects the 
presence of pilus structures in the appropriate Cohl strains. 

The protein extract was prepared as follows. Bacteria were grown in THB to an OD 6 oonm of 
0.5-0.6 and the supernatant was separated from the cells by centrifugation. The supernatant was then 
filtered (0 0.2 um) and 1 ml was added with 60% TCA for protein precipitation. 
GBS pili were also extracted from the fraction of surface-exposed proteins in Cohl strain and its GBS 
80 knock out mutant as described hereafter. Bacteria were grown to an OD 6 oonm of 0.6 in 50 ml of 
THB at 37°C. Cells were washed once with PBS and the pellet was then resuspended in 0.1 M KP04 
pH 6.2, 40% sucrose, 10 mM MgC12, 400U/ml mutanolysin and incubated 3 hours at 37°C. 
Protoplasts were separated by centrifugation and the supernatant was recovered and its protein content 
measured. 

In order to study the dynamics of pilus production during different growth phases, 1 ml 
supernatant of a culture at different OD 60 onm was TCA precipitated and loaded onto a 3-8% SDS- 
PAGE as described before. Figure 47 shows the corresponding Western blot with GBS 80 anti-sera. 
The first group of lanes (left five sample lanes) refer to a Cohl strain growth (OD 6 oonm are noted 
above the lanes) whereas the second group of lanes (right five samples) are from a GBS 80 knock out 
strain over expressing GBS 80. The experiment shows that pilus macromolecular structures can be 
found in the supernatant in all of the growth phases tested. 

Example 12: In GBS strain Cohl, only GBS 80 and a sortase (sag0647 or sag0648) is required 
for polymerization 

This example describes requirements for pilus formation in Cohl. Figure 48 shows a Western 
blot of total protein extracts (prepared as described before) using anti-GBS 80 sera on Cohl clones. 
(Cohl, wild type Cohl; A104, Cohl knocked out for GBS 104, A647, Cohl knocked out for sag0647, 
A648, Cohl knocked for sag0648, A647-8, Cohl knocked out for sag0647 and sag0648; 515, wild 
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80.) The data show that only the double sortase mutant is unable to polymerize GBS 80 indicating that 
the 'conditio sine qua non ' for pilus polymerization is the co-existence of GBS 80 with at least one 
sortase. This result leads to a reasonable assumption that SAG 1405 and SAG 1406 are responsible for 
5 polymerization in this strain. 

Example 13: GBS 80 can be expressed in L. lactis under its own promoter and terminator 
sequences 

This example demonstrates that L. lactis, a non-pathogenic bacterium, can express GBS AI 
10 polypeptides such as GBS 80. L. lactis M1363 (J, Bacteriol. 154 (1983): 1-9) was transformed with a 
construct encoding GBS 80. Briefly, the construct was prepared by cloning a DNA fragment 
containing the gene coding for GBS 80 under its own promoter and terminator sequences into plasmid 
pAM401 (a shuttle vector for E. coli and other Gram positive bacteria; J. Bacteriol. 163 (1986):83 1- 
836). Total extracts of the transformed bacteria in log phase were separated on SDS-PAGE, 
15 transferred to membranes, and incubated with antiserum against GBS 80. A polypeptide 

corresponding to the molecular weight of GBS 80 was detected in the lanes containing total extracts 
of X. lactis transformed with the GBS 80 construct. See Figures 133 A and 133B, lanes 6 and 7. This 
same polypeptide was not detected in the lane containing total extracts of L. lactis not transformed 
with the GBS 80 construct, lane 9. This example shows thatZ. lactis can express GBS 80 under its 
20 own promoter and terminator. 

Example 14: L. lactis modified to express GBS AI-1 under the GBS 80 promoter and 
terminator sequences expresses GBS 80 in polymeric structures 

This example demonstrates the ability of L. lactis to express GBS AI-1 polypeptides and to 
25 incorporate at least some of the polypeptides into oligomers. L. lactis was transformed with a 

construct containing the genes encoding GBS AI-1 polypeptides. Briefly, the construct was prepared 
by cloning a DNA fragment containing the genes for GBS 80, GBS 52, SAG0647, SAG0648, and 
GBS 104 under the GBS 80 promoter and terminator sequences into construct pAM401 . The 
construct was transformed into L. lactis Ml 363. Total extracts of log phase transformed bacteria were 
30 separated on reducing SDS-PAGE, transferred to membranes, and incubated with antiserum against 
GBS 80. A polypeptide with a molecular weight corresponding to the molecular weight of GBS 80 
was detected in the lanes containing L. lactis transformed with the GBS AI-1 encoding construct. See 
Figure 134, lane 2. In addition, the same lane also showed immunoreactivity of polypeptides having 
higher molecular weights than the polypeptide having the molecular weight of GBS 80. These higher 
35 molecular weight polypeptides are likely oligomers of GBS 80. Oligomers of similar molecular 
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w# m§ WepeT m^3Wm^m^^^ot of the culture supernatant of the transformed L. lactis. 

See lane 4 of Figure 135. Thus, this example shows thatZ. lactis transformed to express GBS AI-1 
can efficiently polymerize GBS 80 in the form of a pilus. This pilus structure can likely be purified 
from either the cell culture supernatant or cell extracts. 

5 

Example 15: Cloning and Expression of S. pneumoniae Sp0462 

This example describes the production of a clone encoding a Sp0462 polypeptide and 
expression of the clone. To produce a clone encoding Sp0462, the open reading frame encoding 
Sp0462 was amplified using primers that annealed within the full-length Sp0462 open reading frame 

10 sequence. Figure 150A provides a 893 amino acid sequence of Sp0462. The primers used to produce 
a clone encoding the Sp0462 polypeptide are shown in Figure 150B. These primers annealed to the 
nucleotide sequences encoding the amino acid residues indicated by underlining in Figure 150 A. 
Amplification of the open reading frame encoding Sp0462 using these primers produced the amplicon 
shown at lane 2 of the agarose gel provided in Figure 1 60. The Sp0462 clone encodes amino acid 

15 residues 38-862 of the 893 amino acid residue Sp0462 protein; the italicized residues in Figure 150A 
were eliminated. Figure 151 A provides a schematic depiction of the recombinant Sp0462 
polypeptide. Figure 15 IB shows a schematic depiction of the full-length Sp0462 polypeptide. Both 
the recombinant Sp0462 encoded by the clone and the full-length Sp0462 protein have two collagen 
binding protein type B (Cna B) domains and a von Hillebrand factor A (vWA) domain. The cloned 

20 recombinant Sp0462 lacks the LPXTG motif present in the full-length Sp0462 protein. Western blot 
analysis for expression of the Sp0462 clone did not result in detection of polypeptides with serum 
obtained from S. pneumoniae— infested patients (Figure 152 A) or GBS 80 antiserum (Figure 152B). 

Example 16: Cloning and Expression of S. pneumoniae Sp0463 

25 This example describes the production of a clone encoding a Sp0463 polypeptide and 

detection of recombinant Sp0463 polypeptide expressed from the clone. To produce a clone encoding 

Sp0463, the open reading frame encoding Sp0463 was amplified using primers that annealed within 

the full-length Sp0463 open reading frame sequence. Figure 153 A provides a 665 amino acid 

sequence of Sp0463. The primers used to produce the clone encoding Sp0463 polypeptide are shown 

30 in Figure 153B. These primers annealed to the nucleotide sequences encoding the amino acid 

residues indicated by underlining in Figure 153 A. Amplification of the open reading frame encoding 

Sp0463 using these primers produced the amplicon shown at lane 3 of the agarose gel provided in 

Figure 160. The Sp0463 clone encodes amino acid residues 23-627 of the 665 amino acid residue 

Sp0463 protein; the italicized residues in Figure 153 A were eliminated. Figure 154 A provides a 

35 schematic depiction of the recombinant Sp0463 polypeptide. Figure 154B shows a schematic 

depiction of the full-length Sp0463 polypeptide. Both the recombinant Sp0463 encoded by the clone 

and the full-length Sp0463 protein have a Cna B domain and an E box motif. The cloned recombinant 
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Sqplg fepdpr #S019 rtfFWftf 116 full-length Sp0463 protein. Expression of the Sp0463 
clone resulted in the detection of a 60 kD polypeptide, the expected molecular weight of the 
recombinant Sp0463 polypeptide, by Western blot analysis. See Figure 155. 

Example 17: Cloning and Expression of S. pneumoniae Sp0464 

This example describes the production of a clone encoding a Sp0464 polypeptide and 
detection of recombinant Sp0464 polypeptide expressed from the clone. To produce a clone encoding 
Sp0464, the open reading frame encoding Sp0464 was amplified using primers that annealed either 
within the full-length Sp0464 open reading frame sequence. Figure 157 A provides a 393 amino acid 
sequence of Sp0464. The primers used to produce a clone encoding the Sp0464 polypeptide are 
shown in Figure 157B. These primers annealed to the nucleotide sequences encoding the amino acid 
residues indicated by underlining in Figure 157A. Amplification of the open reading frame encoding 
Sp0464 using these primers produced the amplicon shown at lane 4 of the agarose gel provided in 
Figure 160. The Sp0464 clone encodes amino acid residues 19-356 of the 393 amino acid residue 
Sp0464 protein; the italicized residues in Figure 157A were eliminated. Figure 158 A provides a 
schematic depiction of the recombinant Sp0464 polypeptide. Figure 158B shows a schematic 
depiction of the full-length Sp0464 polypeptide. Both the recombinant Sp0464 encoded by the clone 
and the full-length Sp0464 protein have two Cna B domains. The cloned recombinant Sp0464 lacks 
the LPXTG motif present in the full-length Sp0464 protein. Expression of the Sp0464 clone resulted 
in the detection of a 38 kD polypeptide, the expected molecular weight of the recombinant Sp0464 
polypeptide, by Western blot analysis. See Figure 159. 

Example 18: Intranasal Immunization of Mice with Recombinant L. lactis Expressing GBS 80 
and Subsequent Challenge 

This example describes a method of intranasally immunizing mice using L. lactis that express 
GBS 80. Intranasal immunization consisted of 3 doses at days 0, 14 and 28, each dose administered 
in three consecutive days. Each day, groups of 3 CD-I outbred female mice 6-7 weeks old (Charles 
River Laboratories, Calco Italy) were immunized intranasally with 10 9 or 10 10 CFU of the 
recombinant Lactococcus lactis suspended in 20 ul of PBS. In each immunization scheme negative 
(wild-type L. lactis) and positive (recombinant GBS 80) control groups were used. The immune 
response of the dams was monitored by using serum samples taken on day 0 and 49. The female mice 
were bred 2-7 days after the last immunization (at approximately t= 36 - 37), and typically had a 
gestation period of 21 days. Within 48 hours of birth, the pups were challenged via LP. with GBS in a 
dose approximately equal to an amount which would be sufficient to kill 90 % of immunized pups (as 
determined by empirical data gathered from PBS control groups). The GBS challenge dose is 
preferably administered in 50ml of THB medium. Preferably, the pup challenge takes place at 56 to 
61 days after the first immunization. The challenge inocula were prepared starting from frozen 
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CU $ 1f^fi^ with THB P rior to use - Survival of pups was 

monitored for 5 days after challenge. 



Example 19: Subcutaneous Immunization of Mice with Recombinant L. lactis Expressing GBS 
80 and Subsequent Challenge 

This example describes a method of subcutaneous immunization mice using Z,. lactis that 
express GBS 80. Subcutaneous immunization consists of 3 doses at days 0, 14 and 28. Groups of 3 
CD-I outbred female mice 6-7 weeks old (Charles River Laboratories, Calco Italy) were injected 
subcutaneously with 10 9 or 10 10 CFU of the recombinant Lactococcus lactis suspended in 100 pi of 
PBS. In each immunization scheme, negative (wild-type L. lactis) and positive (recombinant GBS80) 
control groups were used. The immune response of the dams was monitored by using serum samples 
taken on day 0 and 49. The female mice were bred 2-7 days after the last immunization (at 
approximately t= 36 - 37), and typically had a gestation period of 21 days. Within 48 hours of birth, 
the pups were challenged via LP. with GBS in a dose approximately equal to an amount which would 
be sufficient to kill 90 % of immunized pups (as determined by empirical data gathered from PBS 
control groups). The GBS challenge dose is preferably administered in 50ml of THB medium. 
Preferably, the pup challenge takes place at 56 to 61 days after the first immunization. The challenge 
inocula were prepared starting from frozen cultures diluted to the appropriate concentration with THB 
prior to use. Survival of pups was monitored for 5 days after challenge. 

Example 20: Immunization of Mice with GAS AI polypeptides and Subsequent Intranasal 
Challenge 

This example describes a method of immunizing mice with GAS AI polypeptides and 
subsequently intranasally challenging the mice with GAS bacteria. Groups of 10 CD 1 female mice 
aged between 6 and 7 weeks are immunized with a combination of GAS antigens of the invention 
GAS 15, GAS 16, and GAS 18, (15 pg of each recombinant antigen, derived from Ml strain SF370) 
or Z. lactis expressing the Ml strain SF370 adhesin island, suspended in 100 ul of suitable solution. 
Each group receives 3 doses at days 0, 21 and 45. Immunization is performed through subcutaneous 
or intraperitoneal injection for the GAS 15, GAS 16, GAS 18 protein combination. The protein 
combination is administered with an equal volume of Complete Freund's Adjuvant (CFA) for the first 
dose and Incomplete Freund's Adjuvant (IFA) for the following two doses. Immunization is 
performed intranasally for the L. lactis expressing the Ml strain SF370 adhesin island. In each 
immunization scheme negative and positive control groups are used. 

The negative control group for the mice immunized with the GAS 15, GAS 16, GAS 18 
protein combination included mice immunized with PBS. The negative control group for the mice 

i 

immunized withZ. lactis expressing the Ml strain SF370 adhesin island, included mice immunized 
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w|h e^yft^SOBG (Mffl^nsformed with the pAM401 expression vector lacking any 

cloned adhesin island sequence. 

The positive control groups included mice immunized with purified Ml strain SF370 M 

protein. 

5 Immunized mice are then anaesthetized with Zoletil and challenged intranasally with a 25 jllL 

suspension containing 1.2 x 10 6 or 1.2 x 10 8 CFU of ISS 3348 in THB. Animals are observed daily 
and checked for survival. 

Example 21: Active Maternal Immunization Assay 

10 As used herein, an Active Maternal Immunization assay refers to an in vivo protection 

assay where female mice are immunized with the test antigen composition. The female mice are 
then bred and their pups are challenged with a lethal dose of GBS. Serum titers of the female 
mice during the immunization schedule are measured as well as the survival time of the pups 
after challenge. 

15 Mouse immunization 

Specifically, groups of 4 CD-I outbred female mice 6-8 weeks old (Charles River 
Laboratories, Calco Italy) are immunized with one or more GBS antigens, (20 ug of each 
recombinant GBS antigen), suspended in 100 ul of PBS. Each group receives 3 doses at days 0, 
21 and 35. Immunization is performed through intra-peritoneal injection of the protein with an 

20 equal volume of Complete Freund's Adjuvant (CFA) for the first dose and Incomplete Freund's 
Adjuvant (IFA) for the following two doses. In each immunization scheme negative and positive 
control groups are used. 

Immune response is monitored by using serum samples taken on day 0 and 49. The sera are 
analyzed as pools from each group of mice. 

25 Active maternal immunization 

A maternal immunization/neonatal pup challenge model of GBS infection was used to 
verify the protective efficacy of the antigens in mice. The mouse protection study was adapted 
from Rodewald et al. (Rodewald et aL J. Infect. Diseases 166, 635 (1992)). In brief, CD-I female 
mice (6-8 weeks old) were immunized before breeding, as described above. The mice received 20 

30 iig of protein per dose when immunized with a single antigen and 60 jug of protein per dose (15 
jag of each antigen) when immunized with the combination of antigens. Mice were bred 2-7 days 
after the last immunization. Within 48 h of birth, pups were injected intraperitoneally with 50 ]il 
of GBS culture. Challenge inocula were prepared starting from frozen cultures diluted to the 
appropriate concentration with THB before use. In preliminary experiments (not shown), the 

35 challenge doses per pup for each strain tested were determined to cause 90% lethality. Survival of 
pups was monitored for 2 days after challenge. Protection was calculated as (percentage 
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100. Data were evaluated for statistical significance by Fisher's exact test. 



-262- 



WO 2006/078318 PCT/US2005/027239 

Eipbcff T ritsjtf j^flpg^e 7 S 3 ig 

The invention encompasses, but is not limited to, the embodiments enumerated below. 

1. An immunogenic composition comprising a purified Group B Streptococcus (GBS) 
adhesin island (AI) polypeptide in oligomeric form. 

2. The immunogenic composition of embodiment 1 wherein the GBS AI polypeptide is 
selected from a GBS AI-1 . 

3. The immunogenic composition of embodiment 1 wherein the GBS AI polypeptide is 
selected from a GBS AI-2. 

1. An immunogenic composition comprising a purified Group B Streptococcus (GBS) 
adhesin island (AI) polypeptide in oligomeric form. 

2. The immunogenic composition of embodiment 1 wherein the GBS AI polypeptide is 
selected from a GBS AI-1. 

3. The immunogenic composition of embodiment 1 wherein the GBS AI polypeptide is 
selected from a GBS AI-2. 

4. The immunogenic composition of any of embodiments 1-3 wherein the GBS AI 
polypeptide comprises a sortase substrate motif. 

5. The immunogenic composition of embodiment 4 wherein the sortase substrate motif is an 
LPXTG motif. 

6. The immunogenic composition of embodiment 5 wherein the LPXTG motif is represented 
by the amino acid sequence XPXTG, wherein the X at amino acid position 1 is an L, an I, or an F and 
the X at amino acid position 3 is any amino acid residue. 

7. The immunogenic composition of any one of embodiments 1-3 wherein the GBS AI 
polypeptide affects the ability of GBS bacteria to adhere to epithelial cells. 

8. The immunogenic composition of any one of embodiments 1-3 wherein the GBS AI 
polypeptide affects the ability of GBS bacteria to invade epithelial cells. 

9. The immunogenic composition of any one of embodiments 1-3 wherein the GBS AI 
polypeptide affects the ability of GBS bacteria to translocate through an epithelial cell layer. 

10. The immunogenic composition of any one of embodiments 1-3 wherein the GBS AI 
polypeptide is capable of associating with an epithelial cell surface. 

11. The immunogenic composition of embodiment 10 wherein the associating with an 
epithelial cell surface is binding to the epithelial cell surface. 

12. The immunogenic composition of any of embodiments 1-3 wherein the GBS AI 
polypeptide is a full-length GBS AI protein. 

13. The immunogenic composition of any of embodiments 1-3 wherein the GBS AI 
polypeptide is a fragment of a full-length GBS AI protein. 

14. The immunogenic composition of embodiment 13 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the GBS AI protein. 
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P C fVlfcWS^CT of embodiment 2 wherein the GBS AI polypeptide is 

selected from the group consisting of GBS 80, GBS 104, GBS 52, and fragments thereof 

16. The immunogenic composition of embodiment 3 wherein the GBS AI polypeptide is 
selected from the group consisting of GBS 59, GBS 67, GBS 150, 01521, 01523, 01524, and 

5 fragments thereof. 

17. The immunogenic composition of embodiment 15 wherein the GBS AI polypeptide is 

GBS 80. 

18. The immunogenic composition of any of embodiments 1-3 or 15-17 wherein the 
oligomeric form is a hyperoligomer. 

10 19. The immunogenic composition of any of emhodiments 1-3, or 15-17 further comprising a 

Gram positive bacterium antigen not associated with an AI. 

20. The immunogenic composition of embodiment 19 wherein the antigen is selected from 
the group consisting of GBS 322 and GBS 276. 

21. The immunogenic composition of embodiment 20 wherein the antigen is GBS 322. 

15 22. An immunogenic composition comprising a purified Gram positive bacteria adhesin 

island (AI) polypeptide in an oligomeric form. 

23. The immunogenic composition of embodiment 22 wherein the Gram positive bacteria is 
of a genus selected from the group consisting of Streptococcus, Enter oco ecus, Staphylococcus, or 
Listeria, 

20 24. The immunogenic composition of embodiment 23 wherein the Gram positive bacteria is 

of the genus Streptococcus. 

25. The immunogenic composition of any of embodiments 22-24 wherein the Gram positive 
bacteria AI polypeptide comprises a sortase substrate motif. 

26. The immunogenic composition of embodiment 25 wherein the sortase substrate motif is 
25 an LPXTG motif. 

27. The immunogenic composition of any one of embodiments 22-24 wherein the Gram 
positive bacteria AI polypeptide affects the ability of Gram positive bacteria to adhere to epithelial 
cells. 

28. The immunogenic composition of any one of embodiments 22-24 wherein the Gram 
30 positive bacteria AI polypeptide affects the ability of Gram positive bacteria to invade epithelial cells. 

29. The immunogenic composition of any one of embodiments 22-24 wherein the Gram 
positive bacteria AI polypeptide affects the ability of Gram positive bacteria to translocate through an 
epithelial cell layer. 

30. The immunogenic composition of any one of embodiments 22-24 wherein the Gram 
35 positive bacteria AI polypeptide is capable of associating with an epithelial cell surface. 

31. The immunogenic composition of embodiment 30 wherein the associating with an 
epithelial cell surface is binding to the epithelial cell surface. 
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bacteria AI polypeptide is a full-length Gram positive bacteria AI protein, 

33. The immunogenic composition of any of embodiments 22-24 wherein the Gram positive 
bacteria AI polypeptide is a fragment of a full-length Gram positive bacteria AI protein. 

34. The immunogenic composition of embodiment 33 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the Gram positive bacteria AI protein. 

35. The immunogenic composition of embodiment 24 wherein the genus Streptococcus 
bacteria is Group A Streptococcus (GAS) bacteria and the Gram positive bacteria AI polypeptide is a 
GAS AI polypeptide. 

36. The immunogenic composition of embodiment 35 wherein the GAS AI polypeptide is 
selected from a GAS AI-1. 

37. The immunogenic composition of embodiment 35 wherein the GAS AI polypeptide is 
selected from a GAS AI-2. 

38. The immunogenic composition of embodiment 35 wherein the GAS AI polypeptide is 
selected from a GAS AI-3. 

39. The immunogenic composition of embodiment 35 wherein the GAS AI polypeptide is 
selected from a GAS AI-4. 

40. The immunogenic composition of any of embodiments 35-39 wherein the GAS AI 
polypeptide comprises a sortase substrate motif. 

41. The immunogenic composition of embodiment 40 wherein the sortase substrate motif is 
an LPXTG motif. 

42. The immunogenic composition of embodiment 41 wherein the LPXTG motif is 
represented by XXXXG, wherein the X at the first amino acid position is an L, a V, an E, or a Q, 
wherein the X at the second amino acid position is P if the X at the first amino acid position is an L, 
the X at the second amino acid position is a V if the X at the first amino acid position is an E or a Q, 
or the X at the second amino acid position is a V or a P if the X at the first amino acid position is a V, 
wherein the X at the third amino acid position is any amino acid residue, and wherein the X at the 
fourth amino acid position is a T if the X at the first amino acid position is a V, an E, or a Q, or the X 
at the fourth amino acid position is a T, an S, or an A if the X at the first amino acid position is an L. 

43. The immunogenic composition of any one of embodiments 35-39 wherein the GAS AI 
polypeptide affects the ability of GAS bacteria to adhere to epithelial cells. 

44. The immunogenic composition of any one of embodiments 35-39 wherein the GAS AI 
polypeptide affects the ability of GAS bacteria to invade epithelial cells. 

45. The immunogenic composition of any one of embodiments 35-39 wherein the GAS AI 
polypeptide affects the ability of GAS bacteria to translocate through an epithelial cell layer. 

46. The immunogenic composition of any one of embodiments 35-39 wherein the GAS AI 
polypeptide is capable of associating with an epithelial cell surface. 
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epithelial cell surface is binding to the epithelial cell surface. 

48. The immunogenic composition of any of embodiments 35-39 wherein the GAS AI 
polypeptide is a full-length GAS AI protein. 
5 49. The immunogenic composition of any of embodiments 35-39 wherein the GAS AI 

polypeptide is a fragment of a full-length GAS AI protein. 

50. The immunogenic composition of embodiment 49 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the GAS AI protein. 

51. The immunogenic composition of embodiment 36 wherein the GAS AI-1 polypeptide is 
10 selected from the group consisting of M6_Spy0157, M6jSpy0159, M6jSpy0160, CDC SS 

410_fimbrial, IS S3 65 OJimbrial, DSM2071 Jfimbrial, and fragments thereof. 

52. The immunogenic composition of embodiment 37 wherein the GAS AI-2 polypeptide is 
selected from the group consisting of GAS 15, GAS 16, GAS 18, and fragments thereof. 

53. The immunogenic composition of embodiment 38 wherein the GAS AI-3 polypeptide is 
15 selected from the group consisting of SpyM3_0098, SpyM3__0100, SpyM3_0102, SpyM3_0104, 

SPsOlOO, SPs0102, SPs0104, SPs0106, orf78, orf80, orf82, orf84, spyM18_0126, spyM18_0128, 
spyM18_0130, spyM18_0132, SpyoMOlOOOl 56, SpyoMO 1000 155, SpyoMO 1000 154, 
SpyoMO 1000153, SpyoMO 1 000 1 52, SpyoMO 1 000 1 5 1 , SpyoMO 1 000 1 50, SpyoMO 1 000 1 49, 
ISS3040_fimbrial, ISS3776jTimbrial, IS S4959 Jfimbrial, and fragments thereof. 
20 53. The immunogenic composition of embodiment 39 wherein the GAS AI-4 polypeptide is 

selected from the group consisting of 19224134, 19224135, 19224137, 19224139, 19224141, 
20010296_fimbrial, 20020069 Jfimbrial, CDC SS 635_fimbrial, ISS4883_fimbrial, ISS4538_fimbrial, 
and fragments thereof. 

54. The immunogenic composition of embodiment 24 wherein the Streptococcus bacteria is 
25 Streptococcus pneumoniae and the Gram positive bacteria AI polypeptide is a S. pneumoniae AI 

polypeptide. 

55. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 
polypeptide comprises a sortase substrate motif. 

56. The immunogenic composition of embodiment 55 wherein the sortase substrate motif is 
30 an LPXTG motif. 

57. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 
polypeptide affects the ability of S. pneumoniae to adhere to epithelial cells. 

58. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 
polypeptide affects the ability of S. pneumoniae to invade epithelial cells. 

35 59. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 

polypeptide affects the ability of S. pneumoniae to translocate through an epithelial cell layer. 

60. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 

polypeptide is capable of associating with an epithelial cell surface. 
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epithelial cell surface is binding to the epithelial cell surface. 

62. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 
polypeptide is a full-length S. pneumoniae AI protein. 
5 63. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 

polypeptide is a fragment of a full-length S. pneumoniae AI protein. 

64. The immunogenic composition of embodiment 63 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the & pneumoniae AI protein. 

65. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 
10 polypeptide is selected from the group consisting of SP0462, SP0463, SP0464, orf3_670, orf4_670, 

orf5j570, ORF3_14CSR, ORF4_14CSR, ORF5 14CSR, ORF3_19AH, ORF4_19AH, ORF5_19AH, 
ORF319FTW, ORF4_19FTW, ORF5_19FTW, ORF3_23FP, ORF4__23FP, ORF5_23FP, 
ORP3_23FTW, ORF4_23FTW, ORF5__23FTW, ORF3_6BF, ORF4 6BF, ORF5_6BF, ORF3_6BSP, 
ORF4 6BSP, ORF5J3BSP, ORF3 9VSP, ORF4_9VSP, ORF5_9VSP, and fragments thereof. 
15 66. The immunogenic composition of any one of embodiments 22-24, 35-39, 51-54, or 65 

wherein the oligomeric form is a hyperoligomer. 

67. The immunogenic composition of any one of embodiments 22-24, 35-39, 51-54, or 65 
further comprising a Gram positive bacteria antigen not associated with an AI. 

68. The immunogenic composition of embodiment 67 wherein the antigen is selected from 
20 the group consisting of GBS 322 and GBS 276. 

69. An immunogenic composition comprising a first and a second Group B Streptococcus 
(GBS) adhesin island (AI) polypeptide. 

70. The immunogenic composition of embodiment 69 wherein a full-length polynucleotide 
sequence encoding for the first GBS AI polypeptide is not present in a GBS bacteria genome 

25 comprising a polynucleotide sequence encoding for the second GBS AI polypeptide. 

71. The immunogenic composition of embodiment 69 wherein polynucleotides encoding the 
first and the second GBS AI polypeptide are each present in genomes of more than one GBS serotype 
and strain isolate. 

72. The immunogenic composition of embodiment 69 wherein the first GBS AI polypeptide 
30 is encoded by a GBS AI- 1 . 

73. The immunogenic composition of embodiment 69 wherein the first GBS AI polypeptide 
is encoded by a GBS AI-2. 

74. The immunogenic composition of embodiment 72 wherein the second GBS AI 
polypeptide is encoded by a GBS AI-2. 

35 t 75. The immunogenic composition of embodiment 73 wherein the second GBS AI 

polypeptide is encoded by a GBS AI-2. 

76. The immunogenic composition of embodiment 72 wherein the second GBS AI 
polypeptide is encoded by a GBS AI-1. 
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polypeptide is encoded by a GBS AI-1 . 

78. The immunogenic composition of embodiment 72 wherein the first GBS AI polypeptide 

is selected from the group consisting of GBS 80, GBS 104, GBS 52, and fragments thereof. 

5 79. The immunogenic composition of embodiment 73 wherein the first GBS AI polypeptide 

is selected from the group consisting of GBS 59, GBS 67, GBS 150, 01521, 01523, 01524, and 

fragments thereof. 

80. The immunogenic composition of embodiment 74 or 75 wherein the second GBS AI 
polypeptide is selected from the group consisting of GBS 59, GBS 67, GBS 150, 01521, 01523, 

10 01524, and fragments thereof, and wherein the first and the second GBS AI polypeptide are not the 
same polypeptide. 

81. The immunogenic composition of embodiment 76 or 77 wherein the second GBS AI 
polypeptide is selected from the group consisting of GBS 80, GBS 104, GBS 52, and fragments 
thereof, and wherein the first and the second GBS AI polypeptide are not the same polypeptide. 

15 82. The immunogenic composition of any one of embodiments 69-77 wherein the first GBS 

AI polypeptide comprises a sortase substrate motif. 

83. The immunogenic composition of embodiment 82 wherein the sortase substrate motif is 
an LPXTG motif. 

84. The immunogenic composition of embodiment 83 wherein the LPXTG motif is 
20 represented by the sequence XPXTG, wherein the X at amino acid position 1 is an L, an I, or an F and 

the X at amino acid position 3 is any amino acid residue. 

85. The immunogenic composition of any one of embodiments 69-77 wherein the first GBS 
AI polypeptide affects the ability of GBS bacteria to adhere to epithelial cells. 

86. The immunogenic composition of any one of embodiments 69-77 wherein the first GBS 
25 AI polypeptide affects the ability of GBS bacteria to invade epithelial cells. 

87. The immunogenic composition of any one of embodiments 69-77 wherein the first GBS 
AI polypeptide affects the ability of GBS bacteria to translocate through an epithelial cell layer. 

88. The immunogenic composition of any one of embodiments 69-77 wherein the first GBS 
AI polypeptide is capable of associating with an epithelial cell surface. 

30 89. The immunogenic composition of embodiment 88 wherein the associating with an 

epithelial cell surface is binding to the epithelial cell surface. 

90. The immunogenic composition of any of embodiments 69-77 wherein the first GBS AI 
polypeptide is a full-length GBS AI protein. 

91. The immunogenic composition of any of embodiments 69-77 wherein the first GBS AI 
35 polypeptide is a fragment of a full-length GBS AI protein. 

92. The immunogenic, composition of embodiment 91 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the first GBS AI protein. 
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AI polypeptide is in oligomeric form. 

94. The immunogenic composition of any one of embodiments 69-77 wherein the second 
GBS AI polypeptide is in oligomeric form. 

95. The immunogenic composition of any one of embodiments 69-79 wherein the first and 
the second GBS AI polypeptide are associated in a single oligomeric form. 

96. The immunogenic composition of embodiment 95 wherein the first and the second GBS 
AI polypeptides are chemically associated. 

97. The immunogenic composition of embodiment 95 wherein the first and the second GBS 
AI polypeptides are physically associated. 

98. The immunogenic composition of embodiment 93 wherein the oligomeric form is a 
hyperoligomer. 

99. The immunogenic composition of embodiment 94 wherein the oligomeric form is a 
hyperoligomer. 

100. The immunogenic composition of embodiment 76 wherein the first GBS AI polypeptide 
is GBS 80 and the second GBS AI polypeptide is GBS 104. 

101. The immunogenic composition of embodiment 74 wherein the first GBS AI polypeptide 
is GBS 80 and the second GBS AI polypeptide is GBS 67. 

102. The immunogenic composition of any one of embodiments 69-79, 100, or 101 further 
comprising a GBS polypeptide not associated with an AI. 

103. The immunogenic composition of embodiment 102 wherein the GBS polypeptide not 
associated with an AI is selected from the group consisting of GBS 322 and GBS 276. 

104. The immunogenic composition of embodiment 103 wherein the GBS polypeptide not 
associated with an AI is GBS 322. 

105. An immunogenic composition comprising a first and a second Gram positive bacteria 
adhesin island (AI) polypeptide. 

106. The immunogenic composition of embodiment 105 wherein a full length polynucleotide 
sequence encoding for the first Gram positive bacteria AI polypeptide is not present in a genome of a 
Gram positive bacteria comprising a full length polynucleotide sequence encoding for the second 
Gram positive bacteria AI polypeptide. 

107. The immunogenic composition of embodiment 105 wherein polynucleotides encoding 
the first and the second Gram positive bacteria AI polypeptide are each present in genomes of more 
than one Gram positive bacteria serotype and strain isolate. 

108. The immunogenic composition of embodiment 105 wherein the first and the second 
Gram positive bacteria AI polypeptides are of different Gram positive bacteria species. 

109. The immunogenic composition of embodiment 105 wherein the first and the second 
Gram positive bacteria AI polypeptides are of the same Gram positive bacteria species. 
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Gram positive bacteria AI polypeptides are from different AI subtypes. 

111. The immunogenic composition of embodiment 105 wherein the first and the second 
Gram positive bacteria AI polypeptides are from the same AI subtype. 

112. The immunogenic composition of embodiment 105 wherein the first Gram positive 
bacteria AI polypeptide has detectable surface exposure on a first Gram positive bacteria strain or 
serotype but not a second Gram positive bacteria strain or ,subtype and the second Gram positive 
bacteria AI polypeptide has detectable surface exposure on the second Gram positive bacteria strain or 
serotype but not the first Gram positive bacteria strain or serotype. 

113. The immunogenic composition of embodiment 105 wherein the Gram positive bacteria 
is S. pneumonaie, S. mutans, E.faecalis, E.faecium, C. difficile, L. monocytogenes, or C diphtherias 

114. The immunogenic composition of any of embodiments 105-113 wherein the first and 
the second Gram positive bacteria AI polypeptides comprise a sortase substrate motif. 

115. The immunogenic composition of embodiment 114 wherein the sortase substrate motif 
is an LPXTG motif. 

116. The immunogenic composition of embodiment 115 wherein the LPXTG motif is 
represented by XXXXG, wherein the X at amino acid position 1 is an L, a V, an E, an I, an F, or a Q, 
wherein X at amino acid position 2 is a P if X at amino acid position 1 is an L, an I, or an F, wherein 
X at amino acid position 2 is a V if X at amino acid position 1 is a E or a Q, wherein X at amino acid 
position 2 is a V or a P if X at amino acid position 1 is a V, wherein X at amino acid position 3 is any 
amino acid residue, wherein X at amino acid position 4 is a T if X at amino acid position 1 is a V, E, 
I, F, or Q, and wherein X at amino acid position 4 is a T, S, or A if X at amino acid position 1 is an L. 

117. The immunogenic composition of embodiment 105 wherein the first Gram positive 
bacteria AI polypeptide is a first Group A Streptococcus (GAS) AI polypeptide. 

118. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide comprises a sortase substrate motif. 

119. The immunogenic composition of embodiment 118 wherein the sortase substrate motif 
is an LPXTG motif. 

120. The immunogenic composition of embodiment 119 wherein the LPXTG motif is 
represented by XXXXG, wherein the X at the first amino acid position is an L, a V, an E, or a Q, 
wherein the X at the second amino acid position is P if the X at the first amino acid position is an L, 
the X at the second amino acid position is a V if the X at the first amino acid position is an E or a Q, 
or the X at the second amino acid position is a V or a P if the X at the first amino acid position is a V, 
wherein the X at the third amino acid position is any amino acid residue, and wherein the X at the 
fourth amino acid position is a T if the X at the first amino acid position is a V, an E, or a Q, or the X 
at the fourth amino acid position is a T, an S, or an A if the X at the first amino acid position is an L. 

121. The immunogenic composition of embodiment 117 wherein the first GAS AI 

polypeptide affects the ability of GAS bacteria to adhere to epithelial cells. 
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PCl^LHHJa^^ of embodiment 117 wherein the first GAS AI 

polypeptide affects the ability of GAS bacteria to invade epithelial cells. 

123. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide affects the ability of GAS bacteria to translocate through an epithelial cell layer. 

124. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is capable of associating with an epithelial cell surface. 

125. The immunogenic composition of embodiment 117 wherein the associating with an 
epithelial. cell surface is binding to the epithelial cell surface. 

126. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is a full-length GAS AI protein. 

127. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is a fragment of a full-length GAS AI protein. 

128. The immunogenic composition of embodiment 127 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the GAS AI protein. 

129. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is a first GAS AI-1 polypeptide. 

130. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is a first GAS AI-2 polypeptide. 

131. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is a first GAS AI-3 polypeptide. 

132. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is a first GAS AI-4 polypeptide. 

133. The immunogenic composition of any one of embodiments 117 or 129-132 wherein the 
second Gram positive bacteria AI polypeptide is a second GAS AI polypeptide. 

134. The immunogenic composition of embodiment 133 wherein the second GAS AI 
polypeptide is a second GAS AI-1 polypeptide. 

135. The immunogenic composition of embodiment 133 wherein the second GAS AI 
polypeptide is a second GAS AI-2 polypeptide. 

136. The immunogenic composition of embodiment 133 wherein the second GAS AI 
polypeptide is a second GAS AI-3 polypeptide. 

137. The immunogenic composition of embodiment 133 wherein the second GAS AI 
polypeptide is a second GAS AI-4 polypeptide. 

138. The immunogenic composition of embodiment 129 wherein the first GAS AI-1 
polypeptide is selected from the group consisting of M6_Spy0157, M6_Spy0159, M6_Spy0160, CDC 
SS 410jBmbrial, IS S3 65 0_fimbrial, DSM2071 Jfimbrial, and fragments thereof. 

139. The immunogenic composition of embodiment 130 wherein the first GAS AI-2 
polypeptide is selected from the group consisting of GAS 15, GAS 16, GAS 18, and fragments thereof. 
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p £• 'J||40/ |( Jhg jjijq^u^O|^rKiic gojpgjf ition of embodiment 131 wherein the first GAS AI-3 

polypeptide is selected from the group consisting of SpyM3_0098, SpyM3_0100, SpyM3_0102, 
SpyM3_0104, SPsOlOO, SPs0102, SPs0104, SPs0106, orf78, orfSO, orf82, orf84, spyM18_0126, 
spyMl«_0128, spyM18J)130, spyM18J)132, SpyoMO 1000 156, SpyoMO 1000 155, SpyoMO 1000 154, 
SpyoMO 1 000 1 53 , SpyoMO 1 000 1 52, SpyoMO 1000151, SpyoMO 1 000 1 50, SpyoMO 1 000 1 49, 
ISS3040_fimbrial, ISS3776_fimbrial, ISS4959 Jlmbrial, and fragments thereof 

14 L The immunogenic composition of embodiment 132 wherein the first GAS Al-4 
polypeptide is selected from the group consisting of 19224134, 19224135, 19224137, 19224139, 
19224141, 2001 0296 Jfimbrial, 20020069 Jfimbrial, CDC SS 635_fimbrial, IS S48 8 3 Jfimbrial, 
ISS4538_fimbrial, and fragments thereof. 

142. The immunogenic composition of embodiment 134 wherein the second GAS AI-1 
polypeptide is selected from the group consisting of M6_Spy0157, M6jSpy0159, M6_Spy0160, CDC 
SS 410_fimbrial, ISS3650_fimbrial, DSM2071_fimbrial, and fragments thereof. 

143. The immunogenic composition of embodiment 135 wherein the second GAS AI-2 
polypeptide is selected from the group consisting of GAS 15, GAS 16, GAS 18, and fragments thereof. 

144. The immunogenic composition of embodiment 136 wherein the second GAS AI-3 
polypeptide is selected from the group consisting of SpyM3_0098, SpyM3_0100, SpyM3_0102, 
SpyM3_0104, SPsOlOO, SPs0102, SPs0104, SPs0106, orf78, orf80, orfS2, orf84, spyM18_0126, 
spyM18_0128, spyM18_0130, spyM18_0132, SpyoMOlOOOl 56, SpyoMO 1000 15 5, SpyoMO 1000 154, 
SpyoMO 1 000 1 53 , SpyoMO 1 000 1 52, SpyoMO 1 000 1 5 1 , SpyoMO 1 000 1 50, SpyoMO 1 000 149, 
ISS3040jSmbrial, IS S3 776 Jfimbrial, ISS4959_fimbrial, and fragments thereof. 

145. The immunogenic composition of embodiment 137 wherein the second GAS AI-4 
polypeptide is selected from the group consisting of 19224134, 19224135, 19224137, 19224139, 
19224141, 2001 0296 Jimbrial, 20020069_fimbrial, CDC SS 635J5mbrial, ISS4883_fimbrial, 
ISS4538_fimbrial, and fragments thereof. 

146. The immunogenic composition of any one of embodiments 1 17-132 or 138-141 wherein 
the second Gram positive bacteria AI polypeptide is a Group B Streptococcus (GBS) AI polypeptide. 

147. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide 
comprises a sortase substrate motif. 

148. The immunogenic composition of embodiment 147 wherein the sortase substrate motif 
is an LPXTG motif. 

149. The immunogenic composition of embodiment 148 wherein the LPXTG motif is 
represented by the amino acid sequence XPXTG, wherein the X at amino acid position 1 is an L, an I, 
or an F and the X at amino acid position 3 is any amino acid residue. 

150. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide 
affects the ability of GBS bacteria to adhere to epithelial cells. 

151. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide 

affects the ability of GBS bacteria to invade epithelial cells. 
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|P |p IjT^ °f embodiment 146 wherein the GBS AI polypeptide 

affects the ability of GBS bacteria to translocate through an epithelial cell layer. 

153. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide is 
capable of associating with an epithelial cell surface. 
5 154. The immunogenic composition of embodiment 146 wherein the associating with an 

epithelial cell surface is binding to the epithelial cell surface. 

155. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide is 
a full-length GBS AI protein. * 

156. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide is 
10 a fragment of a full-length GBS AI protein. 

157. The immunogenic composition of embodiment 156 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the GBS AI protein. 

158. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide is 
a GBS AI-1 polypeptide. 

15 159, The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide is 

a GBS AI-2 polypeptide. 

160. The immunogenic composition of embodiment 158 wherein the GBS AI-1 polypeptide 
is selected from the group consisting of GBS 80, GBS 104, GBS 52, and fragments thereof. 

4 

161. The immunogenic composition of embodiment 159 wherein the GBS AI-2 polypeptide 
20 is selected from the group consisting of GBS 59, GBS 67, GBS 150, 01521, 01523,' 01524, and 

fragments thereof. 

162. The immunogenic composition of any one of embodiments 117-132 or 138-141 wherein 
the second Gram positive bacteria AI polypeptide is a Streptococcus pneumoniae AI polypeptide. 

163. The immunogenic composition of embodiment 162 wherein the S. pneumoniae AI 
25 polypeptide comprises a sortase substrate motif. 

164. The immunogenic composition of embodiment 163 wherein the sortase substrate motif 
is an LPXTG motif. 

165. The immunogenic composition of embodiment 162 wherein the S. pneumoniae AI 
polypeptide affects the ability of S. pneumoniae to adhere to epithelial cells. 

30 166. The immunogenic composition of embodiment 162 S. pneumoniae AI polypeptide 

affects the ability of S. pneumoniae to invade epithelial cells. 

167. The immunogenic composition of embodiment 162 wherein the & pneumoniae AI 
polypeptide affects the ability of S. pneumoniae to translocate through an epithelial cell layer. 

168. The immunogenic composition of embodiment 162 wherein the S. pneumoniae AI 
35 polypeptide is capable of associating with an epithelial cell surface. 

169. The immunogenic composition of embodiment 168 wherein the associating with an 
epithelial cell surface is binding to the epithelial cell surface. 
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polypeptide is a full-length S. pneumoniae AI protein. 

171. The immunogenic composition of embodiment 162 wherein the S. pneumoniae AI 
polypeptide is a fragment of a full-length S. pneumoniae AI protein. 
5 172. The immunogenic composition of embodiment 162 wherein the fragment comprises at 

least 7 contiguous amino acid residues of the S. pneumoniae AI protein. 

173. The immunogenic composition of embodiment 162 wherein the S. pneumoniae AI polypeptide 
is selected from the group consisting of SP0462, SP0463, SP0464, orf3__670, orf4_670, orf5_670, 
ORF3_14CSR, ORF414CSR, ORF514CSR, ORF319AH, ORF4_19AH, ORF5_19AH, 

10 ORF3 19FTW, ORF4__19FTW, ORF5_19FTW, ORF3_23FP, ORF4_23FP, ORF5J23FP, 
ORF3J23FTW, ORF4_23FTW, ORF5_23FTW, ORF3 6BF, ORF4_6BF, ORF5_6BF, ORF3_6BSP, 
ORF4_6BSP, ORF5_6BSP, ORF3_9VSP, ORF4_9VSP, ORF5_9VSP, and fragments thereof. 

174. The immunogenic composition of any one of embodiments 105-117 wherein the first 
Gram positive bacteria AI polypeptide is in oligomeric form. 

15 175. The immunogenic composition of embodiment 174 wherein the oligomeric form is a 

hyperoligomer. 

176. The immunogenic composition of embodiment 174 wherein the second Gram positive 
bacteria AI polypeptide is in oligomeric form. 

177. The immunogenic composition of embodiment 176 wherein the oligomeric form is a 
20 hyperoligomer. 

178. The immunogenic composition of embodiment 176 wherein the first and the second 
Gram positive bacteria AI polypeptide are associated in a single oligomeric form. 

179. The immunogenic composition of embodiment 178 wherein the first and the second 
Gram positive bacteria AI polypeptide are chemically associated. 

25 180. The immunogenic composition of embodiment 178 wherein the first and the second 

Gram positive bacteria AI polypeptide are physically associated. 

181. The immunogenic composition of any one of embodiments 105-1 17 further comprising 
a Gram positive bacteria polypeptide not associated with an AI. 

182. The immunogenic composition of embodiment 181 wherein the Gram positive bacteria 
30 polypeptide not associated with an AI is selected from the group consisting of GBS 322 and GBS 276. 

183. The immunogenic composition of embodiment 182 wherein the Gram positive bacteria 
polypeptide not associated with an AI is GBS 322. 

184. A modified Gram positive bacterium adapted to produce increased levels of AI surface 

protein. 

35 185. The modified Gram positive bacterium of embodiment 184 wherein the AI surface 

protein is in oligomeric form. 

186. The modified Gram positive bacterium of embodiment 185 wherein the oligomeric form 
is a hyperoligomer. 
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ir> a l \Ps F l l S::Wrti:S e 4'9$ ai ^ Sff^Sfiffii bacterium of any one of embodiments 184-186 which is a 

Group B Streptococcus bacterium. 

188. The modified Gram positive bacterium of any one of embodiments 184-186 which is a 
Group A Streptococcus bacterium. 
5 189. The modified Gram positive bacterium of any one of embodiments 184-186 which is a 

non-pathogenic Gram positive bacterium. 

190. The modified Gram positive bacterium of embodiment 189 wherein the non-pathogenic 
Gram positive bacterium is Streptococus gordonii. 

191. The modified Gram positive bacterium of embodiment 189 wherein the non-pathogenic 
1 0 Gram positive bacterium is Lactococcus lactis. 

192. The modified Gram positive bacterium of any one of embodiments 184-186 which has 
been inactivated and wherein the AI surface protein is exposed on the surface of the Gram positive 
bacterium. 

193. The modified Gram positive bacterium of any one of embodiments 184-186 which has 
15 been attenuated- and wherein the AI surface protein is exposed on the surface of the Gram positive 

bacterium. 

194. The modified GBS bacterium of embodiment 187 which has been inactivated and 
wherein the AI surface protein is exposed on the surface of the GBS bacterium. 

195. The modified GBS bacterium of embodiment 187 which has been attenuated and 
20 wherein the AI surface protein is exposed on the surface of the GBS bacterium. 

196. The modified GAS bacterium of embodiment 188 which has been inactivated and 
wherein the AI surface protein is exposed on the surface of the GAS bacterium. 

197. The modified GAS bacterium of embodiment 188 which has been attenuated and 
wherein the AI surface protein is exposed on the surface of the GAS bacterium. 

25 198. The modified non-pathogenic bacterium of embodiment 189 which has been inactivated 

and wherein the AI surface protein is exposed on the surface of the non-pathogenic Gram positive 
bacterium. 

199. The modified non-pathogenic bacterium of embodiment 189 which has been attenuated 
and wherein the AI surface protein is exposed on the surface of the non-pathogenic Gram positive 

30 bacterium. 

200. A method for manufacturing an oligomeric adhesin island (AI) surface antigen 
comprising: 

culturing a Gram positive bacterium that expresses an oligomeric AI surface antigen and 

isolating the expressed oligomeric AI surface antigen. 
35 201. The method of embodiment 200 wherein the step of isolating is performed by collecting 

said oligomeric AI surface antigen from Gram positive bacterium secretions in the Gram positive 
bacterium culture. 

202. The method of embodiment 200 further comprising a step of purifying. 
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from the Gram positive bacterium cell surface. 

204. The method of embodiment 200 wherein the Gram positive bacterium is adapted for 
increased AI protein expression. 
5 205. The method of any one of embodiments 200-204 wherein the Gram positive bacterium 

is a Group A Streptococcus bacterium. 

206. The method of any one of embodiments 200-204 wherein the Gram positive bacterium 
is a Group B Streptococcus bacterium. 

207. The method of any one of embodiments 200-204 wherein the oligomeric AI surface 
10 antigen is in hyperoligomeric form. 

208. The method of embodiment 200 wherein the Gram positive bacterium expresses the 
oligomeric AI surface antigen recombinantly. 

209. The method of embodiment 208 wherein the Gram positive bacterium further 
manipulated expresses at least 1 AI sortase. 

15 210. The modified Gram positive bacterium of any one of embodiments 184-186 which is a 

S. pneumoniae bacterium. 

211. The method of any one of embodiments 200-204 wherein the Gram positive bacterium 
is S. pneumoniae. 
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1. An immunogenic composition comprising a purified Group B Streptococcus (GBS) 
adhesin island (AI) polypeptide in oligomeric form. 

2. The immunogenic composition of claim 1 wherein the GBS AI polypeptide is selected, 
5 from a GBS AI-1. 

3. The immunogenic composition of claim 1 wherein the GBS AI polypeptide is selected 
from a GBS AL-2. 

4. The immunogenic composition of claim 2 wherein the GBS AI polypeptide is selected 
from the group consisting of GBS 80, GBS 104, GBS 52, and fragments thereof. 

10 5. The immunogenic composition of claim 3 wherein the GBS AI polypeptide is selected 

from the group consisting of GBS 59, GBS 67, GBS 150, 01521, 01523, 01524, and fragments 
thereof. 

6. The immunogenic composition of claim 4 wherein the GBS AI polypeptide is GBS 80. 

7. The immunogenic composition of any of claims 1-6 wherein the oligomeric form is a 
1 5 hyperoligomer. 

8 (22). An immunogenic composition comprising a purified Gram positive bacteria adhesin 
island (AI) polypeptide in an oligomeric form. 

9 (23). The immunogenic composition of claim 8 wherein the Gram positive bacteria is of a 
genus selected from the group consisting of Streptococcus \ Enterococcus, Staphylococcus, 

. 20 Clostridium, Corynebacterium, or Listeria. 

1 10 (24). The immunogenic composition of claim 9 wherein the Gram positive bacteria is of 

the genus Streptococcus. 

11 (35). The immunogenic composition of claim 10 wherein the genus Streptococcus 
bacteria is Group A Streptococcus (GAS) bacteria and the Gram positive bacteria AI polypeptide is a 

25 GAS AI polypeptide. 

12 (36). The immunogenic composition of claim 11 wherein the GAS AI polypeptide is 
selected from a GAS AI-1. 

13 (37). The immunogenic composition of claim 11 wherein the GAS AI polypeptide is 
selected from a GAS AI-2. 

30 14 (38). The immunogenic composition of claim 11 wherein the GAS AI polypeptide is 

selected from a GAS AI-3. 

15 (39). The immunogenic composition of claim 11 wherein the GAS AI polypeptide is 
selected from a GAS AI-4. 

16 (66). The immunogenic composition of any one of claims 8-15 wherein the oligomeric 
35 form is a hyperoligomer. . 

17. An immunogenic composition comprising a first and a second Group B Streptococcus 
(GBS) adhesin island (AI) polypeptide. 
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«... .,...„ ^Ijjbpjiinmijra of claim 17 wherein the first GBS AI polypeptide is 

encoded by a GBS AI-1. 

19. The immunogenic composition of claim 18 wherein the second GBS AI polypeptide is 
encoded by a GBS AI-2. 

5 20. The immunogenic composition of claim 18 wherein the first GBS AI polypeptide is 

selected from the group consisting of GBS 80, GBS 104, GBS 52, and fragments thereof. 

21. The immunogenic composition of claim 19 wherein the second GBS AI polypeptide is 
selected from the group consisting of GBS 59, GBS 67, GBS 150, 01521, 01523, 01524, and 
fragments thereof, and wherein the first and the second GBS AI polypeptide are not the same 

10 polypeptide. 

22. The immunogenic composition of claim 19 wherein the first GBS AI polypeptide is GBS 
80 and the second GBS AI polypeptide is GBS 67. 

23. An immunogenic composition comprising a first and a second Gram positive bacteria 
adhesin island (AI) polypeptide. 

15 24. The immunogenic composition of claim 23 wherein the Gram positive bacteria is 

Streptococcus, Enterococcus, Staphylococcus, Clostridium, Cory neb act erium, or Listeria. 

25. The immunogenic composition of claim 23 wherein the first Gram positive bacteria AI 
polypeptide is a first Group A Streptococcus (GAS) AI polypeptide. 

26. The immunogenic composition of claim 25 wherein the first GAS AI polypeptide is a 
20 first GAS AI-1 polypeptide. 

27. The immunogenic composition of claim 25 wherein the first GAS AI polypeptide is a 
first GAS AI-2 polypeptide. 

28. The immunogenic composition of claim 25 wherein the first GAS AI polypeptide is a 
first GAS AI-3 polypeptide. 

25 29. The immunogenic composition of claim 25 wherein the first GAS AI polypeptide is a 

first GAS AI-4 polypeptide. 

30. The immunogenic composition of any one of claims 25-29 wherein the second Gram 
positive bacteria AI polypeptide is a second GAS AI polypeptide. 

31. The immunogenic composition of claim 30 wherein the second GAS AI polypeptide is a 
30 second GAS AI-1 polypeptide. 

32. The immunogenic composition of claim 30 wherein the second GAS AI polypeptide is a 
second GAS AI-2 polypeptide. 

33. The immunogenic composition of claim 30 wherein the second GAS AI polypeptide is a 
second GAS AI-3 polypeptide. 

35 34. The immunogenic composition of claim 30 wherein the second GAS AI polypeptide is a 

second GAS AI-4 polypeptide. 

35. A modified Gram positive bacterium adapted to produce increased levels of AI surface 

protein. 
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„ 3jS. a( jnie l ^adljged >( Gcj|ix^p^itW|gpbacterium of claim 35 wherein the AI surface protein is in 



oligomeric form. 

37. The modified Gram positive bacterium of claim 36 wherein the oligomeric form is a 
hyperoligomer. 

38. The modified Gram positive bacterium of any one of claims 35-37 which is a non- 
pathogenic Gram positive bacterium. 

39. The modified Gram positive bacterium of claim 38 wherein the non-pathogenic Gram 
positive bacterium is Lactococcus lactis. 

40. A method for manufacturing an oligomeric adhesin island (AI) surface antigen 
comprising: 

culturing a Gram positive bacterium that expresses an oligomeric AI surface antigen and 
isolating the expressed oligomeric AI surface antigen. 
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TAACACCGA TTGCA A _AT A ACATCAATC A ^^MS^^^^^^ ^a^^a, 2603_ail . s«, 

taacacgc at ic "::: uc4TcuTc a ^^f^^SW!™"™" 1 '!hf !t;!^ 

Tjlc!cCCATTCCil»Ti!riTri* T ''''' T ^ CGCCTCCTATCAACCAA cjblll Heq 
lilcACtCATTCClllTijriTrii'r'**' CGCCTCCTATCAACCAA nem3I6~al I .seq 
TAACACGCATTGCAAATAACATCAATCAAATtDcGCGTGCTATCAACCAA a909_aTl.seq 

Majority 




AGC^^Tr^tt^J^ CAGCCTCACTATCACTTTCATCCAGA ACCACCTTA 
AGCCATCTAATTTCTCAGCCTCAGTATCACTTTCATCCACAAGCACCTTA 



2603_ail.seq 
18rs21_ai.I.seq 
cohl_all.seq 
cjblll_ail.seq 
ne«316_ail.seq 
a909__ail.seq 

AGCCCAACAGCAGCGGGGGTTGCTCTGATT C GGTTAAGT. AATAAGCAAAA Majority 

110 120 130 _ Ho 

J : J L 

26031a il. seq 
i8rs21_aii.seq 
cohi_all.seq" " 
cjblll_ail.seq 
nem3 1 6ja i I .seq 
a909_ail..seq 

TCCTCACGAATTTT CCCAG TAATAAGAGGCGTTCTTGAGGTATACCCATT Majority 




A 5m TG CTC TGATTCGGTTAAGT AATAAGCAAAA 

Cc! Arr r T CTCTCATTCCCTTAAGT AATAAGCA A AA 
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AGCCCAACAGCAGGGGGGGTTGCT. GTGATTCGGTTAAGTAATAAGCAAAA 
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TCCTCACCAATTTTCCCAGTAATAAGAGGCGTTGTTGAG G T A T AG GGATT a909_.a7l.seq 

A. CCTGTCCCATAGTtCATGATCCTCACAAAA T CACCATTCCCCATAAAAG Majority 

2 i° 220 , 230 2 jo So 
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AC A a909_ail.seq 
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C A G A A A T 2603_ail.seq 
C A G A A A T 18rs21_all.seq 
A C A A A T cohi_ail.seq 
A G A A A T cjblll_all.seq 
nem316_ai I .seq 
a909_ail.seq 
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T 
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A 
A 



A 
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A 

A 
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G A G T T 2603_ail.seq 
G A G T T 18rs21_aH.seq 
G A G T T cohl_ail,seq 
G A G T T cjbni_ail.seq 
G A G T T nem316_aii .seq 
G A G T T a909^ait.seq 




T C T.A T G G A C T.G.G A T.T A G C T G A A A AT T T T A G A C T A A A A A G T A G T G C 
Z 1 1 * * 4 T G C A T G C * G A A- X A T T T T A G A C T A A A A A G T A G T G C 
T G T A UG AXI TG G A T-T, A G C T G A AlA AT T T T A G A C T A A AAA G T A G T G C 
I ZZ ttS C ^ * A t7 * G A A A A T T;T T A G A C T.A A A A A G T A G T G C 

GTA '^ G A G;*T. G G ATTT A G-e* G A.;A>A: : A ^r*r% A G A; CIA A A A A G T ACT GO 

:t q^gj^tt a.g e p h:& : ##T:X:jfo'k g a 'c t a a a a a gt a g t g c 



T GT A T G G ACT 



C A G T G 2603_ail.seq 
C'A G* T G 18r32l_aii.seq 
C A G T G cdhl Jai l -seq 
C : A G t;G cJbUUall.seq 
C A G. T G vnfe^ie^ai r.seq 1 
G A G T q a909£^il .sSq 




G A.TTTT 
G ATT.TT 
G A T T T T 
GATTT.T 
G A T T T T 
G ATTTT 



G A A A 

G A A A 

-G A A A 

G A A A 

G A A A 

G A A A 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



T A A 
T A A 
T A A 
T A A 
T'A'A 
T A A 



C A 
C A 
C A 
C A 
C A 
C A 



A 
A 
A 
A 
A 
A 



A A G 
A A G 
A A G 
A AG 
A A G 
A A G 



A G T C T T 
A GTCTT 
A G t;c T T 

AGTCTT 
A G f C.T T 
A G tIIt T 



CCGTTTA A 
CCGTTTAA 
C C GT T T A A 
CCGTTTAA 
CCGTTTAA 
CCGTTTAA 



A 
A 
A 
A 
A 
A 



C G A G A A 
C G A G A A 
CGAGAA 
C G A G A A 
CGAGAA 
CGAGAA 



G G C 
G G C 
G G C 
G.CC 
GCC 
G G C 



THT T T. 2603_ail.seq 
tHt T T 18rs2Uail.seq 
T G T T T cohlj.all .seq 
T G T T T cjbUi_all.seq 
T G T T T nem316__ail.seq 
T G T T t a909_ail.seq 
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Vfignroent Report of AM_aKgnnient, using J. Hein method with Weighted residue weight table, ~ 

lu^day^ ya^aocMs:^- "IF" ? li li Si D EL iglgl SL 96 

TTAGTCGTTTCAGATCAA GGCAAAAGCACCATGAAATAATTCCGTACCTT Majority 
™ 720 730 740 750 

00 TTAGTCCTTTCAGATCAAGGCAAAAGCACGATGAAATAATTCCCTACCTT 2603 ail seq 
169 TTAGTCCTTTCAGATGAAGGCAAAAGGACGATGAAATAATTCCGTACCTT I8rs2i al iseq 

01 TTAGTCGTTTCAGATGAAGGCAAAAGGACGATCAAATAATTCCCTACCTT cohl ail seq 

00 TTAGTCGTTTCACATGAAGGCAAAACCACGATCAAATAATTCCGTACCTT cjbUl ail seq 

01 TTACTCCTTTCAGATCAAGGCAAAAGGACGATCAAATAATTCCCTACCTT nta316~ai Iseq 
Ol TTAGTCGTTTCAGATGAAGGCAAAAGGACGATCAAATAATTCCGTACCTT a909_ai~l .seq 

CATCGATTGCTATGTTACT GGCATGAG G»T CTCACGATATTTAG T A A GAT A Majority 
, 760 77Q 780 790 800 

CATGCATTGCTATGTTACTGGCATGAGGTCTCACGATATTTAGTAAGATA 2603 ail seq 
CATGGATTGCTATGTT AC TGGCA.TGAGG T.C TCACGATATTTA GT A AG ATA I8rs21 all?seq 

CATGGATTGCTATGTT-ACTGGC. ATGAG'GTCTCACGATATTTAGTAAGATA oohl_ail seq 

CATGCATTGCTATGTTACTGGCATGAGGTCTCACGATATTTAGTAAGATA cjblll_auleq 

CATGGATTGCTATGTTACTGGCATGAGGTCTCACGATATTTAGTAAGATA ne*316 ail seq 

CATGGATTGCTATGTTACTGCCATGAGGTCTCACCATATTTAGTAACATA a9Q9_a7l.seq 

TTCGTTt. GAAGATATTCCCACGTATTTTTTAAAGGTTTTAAGAAAATGTG Ma J o r 1 ty 
S }° 820 830 840 850 

^ , „ ^ — ^ Lui fi_. r i_ i_ j_l - i in ■ i - - —| ^ -i _ , j nj u i .rn - j^ j__i £ 

TTCGTTTGAAGATATTCCCACGTATTTTTTAAAGGTTTTAAGAAAATGTG 2603 ail seq 
TTCGTTTGAAGATATTCCCACGTATTTTTTAAAGGTTTTAACAAAATGTG 18rs2l ailTeq 
T T*C GTTTGAAGATATTCCCACGTATTTTTTAAAGGTTTTAACAAAATGTG oohl aTl seT 
T T C G T T J G.A A G A T A T.T C C C A C G T A T T T T. T T A A A G G T T T T A A C A A A A T G T G cjblll^ailseq 
TTCGTTTGAA-QATATTCCCACGTATTTTTTAAAGGTTTTAAGAAAATGTG ne*316_alKseq 
TTCGTTTCAAGATATTCCCACGTATTTTTTAAAGGTTTTAAGAAAATGTG a909_all,seq 

T G C T G T C G T A A A A A T GT A AT A A TT TC C C T A CTTC CC C A AT C G CT A C CC C T Majority 
860 , 870 880 890 900 

TGGTGTCGTAAAAATGTAA T A A TTTCGCTACTTCCCCAATC GGTACCCCT 2603_ail.seq 
TGCTGTCGTAAAAATGTA.ATAAfTT.CCCT A CTTCC CC AATCGGTACCCCT 18rs21_ai 1 seq 
TGGTGTCGTAAAAATGTAATAATTTCGCTACTTCCCCAATCGGTACCCCTcohl ail. seq 

50 TGCTCTC G T.A AAAATGTAATAATTT C.G.C TACTTCCCCAATC GGTACCCCT clblll allseq 

51 TGGTGTCGTAA A A A.TGTA A T/A AT.TTCGCTACTTCCCCAATCGGTACCCCT ne*316 alllseq 



50 
19 
51 
50 
51 
51 



00 
69 
01 
DO 

or 

01 



50 
19 
51 



X) 



11 

JO 

n 
n 



9 

il 

so 
\i 



too 

59 



T G G T GT C G T A A A A A T G T A AT A A TTTCG CTACTtCC CCAATCCGTACCCCT a909_all.seq 

CTTT GCACTAGT AATTGTCC.CTCCTTA-ATTTTTCCCT TT A CAATATA ACT Majority 

9JQ 920 930 940 950 

CTTTGCAGTAGTAATTGTCCCTCCTT.AATTTTTGCCTTTAGAATATAACT 



2603 all seq 

59 CTTTGCAGTAGTAATTGTCCCTCCTT.AATTTTTGCCTT. TAGAATATAACT i8rs21_auTseq 

» CTTTGCAGTAGTAATTGTCCCTCCTTAATTTTTGCCTTTAGAATATAACT cohi.ail.seq 

CTTTCCAGTAGT. AATTGTCCCTCCTTAATTTT. TGCCTT TAGAATATAACT cjbUl all seq 

CTTT'GCAG. TAGTAATTCTCCCTC'CTTAATTTTTG'CCTTTAGA'ATATA'ACT nemSielall seq 

CTTTCCAGTAGTAATTCTCCCTCCT T A A TTTTTGCCTTTAGAATATAACT a909_all.seq 

T T G C A A G G A A A T G T C A G A G T A T T T T T T A A A A A C T G A G C G t A A G T G C C A A T Majority 



960 . ; 97d 980 . „ 990 * 1000 



T T G. C A AG G A A A T GT CAGAGTATT T f T TA.AA A A C T G A G C G T A A G T G G G A A T 2603_ail.seq 
T T fC A A G GA A A T- G T C AG A G;T A T t'T T T * A AX A AC TGAGC.GT A A GT G G G A A t i8rs2lLaiiIseq 
TT.G C A A G G A A A TOT C A C A G T A T T:T T T T A A A A A G f GAG C G T A A G T G G G A A T cdhUail sea/ 
T T G C A AG G A A AT GT C A G.A G T AJTT TT T A AAA A cHjG A CC G T A A G T G C G A A T cj bll i_al 1 ;seq 
TT.G C A A CGAAATCT C A G A G T A T T T TT T A A AAA CM G t\G C G T A ACT G G; G A A T nem316_ai i .seq 
TT G C A A.G G A A A T G.T.C AGAGTAT T T.T T T A A A A A C||G A G C G T A A GT G CCAAT a909 ail. seq 

C t G A C A T AT AT A GC T A G T t G G C A A TAT GAG AT ACT T T G A G T T T G G A G t A G Majority 

J l 9}0 . *020 . ' . 1030 , 104Q . 1050 

CT.'C AG AT ATATAGGTAGTTG G.C AATATC A GAT ACTTT.GA GTTTGG AG TAG 2603_ail.seq 
CT.G A GATATATAGGTAG.TTGGC AATATC AG AT A C T T T G A G.T TTGGAGTAG 18rs2I>il.seq 
Ol XT G A G ATA T.A T AGGT A GT TC G C A AT AT C A GAT A CT TTGA.GTTT GGA G TAG cohUail .seq 
KK> HT.G A GAT A T A TA'GG T.A CTTG'GCAATA TC A G AT ACTTTGAGTTTG.GAGT AG cjblll_aU .seq 
m CT.GAGATATATAGGTAGTTGGC A ATATCAGATACTTTGAGT TTGGAGTAG nem316_ail.seq 
m CT G AG A T A T A T A G G T A G T T.C G C A AT A. T C A G A T A CTTT CAGTTTGG ACTAG a909_all.seq 
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A G G T C G gg SIZAWf rW | CTC'A |gf C^T ATTGTACAT TTCCTGCTTCTC Majority ; 



ilOO 

B69 

UOl 

IlOO 

UOl 

UOl 



LISO 
J19 
1151 
I ISO 



1( >60 1070 1O80 1090 



4 -i 1_ 



1100 

_JL 



1050 AGGTGGTTGTCCACATAATGGAGAATACTATTGTACATTTGCTGCTTGTC 2603_all seq 
B19 ACGTCGTTGTCCACATAATGGAC A ATACT ATTGTACATTTCCTGCTTCTC 18rs21_ailseq 

1051 AGGTGCTTGTCCACATAATGGAGAATACTATTGTAC. ATTTCCTGCTTGTC cohl ail sea 

1050 A GGTGCTTGTC CACATAATGGAGAATACTATTGTACATTTGCTGCTTGTC cjblll ail seq 

1051 AGGTGGTTGTCCACATAATGGAGAATACTATTCTACATTTGCTGCTTGTC nem3 16~ ai I seq 
1051 AGGTCGTTGTCCACATAATGGAGAATACTATTGTACATTTGCTGCTTGTC a909lail_seq 

ACAGATGCTCTTATTCGTT AACC AT TCTGAAAAATCAATAAGAGCTGCAC Majority 
»W 1120 1130 1140 1150 

A G.A G A T G C T C T T A T T C C T T A A C G A T T C T G A A A A A T C A A T A A C A G C T G C A C 2603 all seq 

AGAGATCCTCTTATTGGTTAAGCATTCTGAAAAATCAATAACAGCTGCAC 18rs21 ailseq 

AGAGATGCTCTTATTGGTTAAGGATTCTGAAAAATCAATAACAGCTGCAC cohl ail seq 

ACAGATGCTCTTATTGGTTAAGCATTCTGAAAAATCAATAACACCTGCAC cjblTl ailseq 

AGAGATCCT C T T A T T G G T T A A G C A T T C T C A A A A A TC A A T A A G A GC T G C A C ne*316_airseq 

AGAGATGCTCTTATTGGTTAAGGATTCTGAAAAATCAATAAGAGCTGCAC a909_ail.seq 

AG.CGAATTCT.T CAAACATCAATAAG ATCAGGAGC. CTCTTC CTTT AAACCC Majority * 

1160 U 70 1180 1190 1200 

— — 1 « I : I |__ 

AGCGAATTC T.T CAAACATCAATAAGATCAGGAGGCTCT T V C GTTTAAAGCC 2603_ail seq 
AGGG AATTCTTG A AACATCAATA AGATCAGG AGCCTCTTCGTTTAAAGCC 18rs2I_ailseq 
AGiCGAATTCTTGAAACATCAATAAGATCAGGAGCCTCTTCGTTTAAAGCG cohl_ail seq 
AG CCAATTCTTGA A ACATCAATAAGATCAGCAGCCTCT T.C GTTTAAAGCC cjblll ail seq 
1151 AGGCAATTCTTGAAACATCAATAAGATCAGG.AGCC tIJt TCGTTTAAAGCC nexn316_ai I*seq 
1151 AGCGAATTCTTGAAACATCAATAAGATCAGGAGCCTCT T.C G TTTAAAGC C a909_all.seq 

ATATACTC.CTT TACCAGCGCATAACTTTTAGCCACATCACTATTTTCCTC Majority 
12*0 1220 1230 1240 1250 

1200 A T AT AG T GCTTT A C C ACCCC A'T A AC TTTT A'CCC AC ATC ACT ATT TTCCTC 2603_ail seq 
K59 A T A T A G T C C T T T A C C A G C G C A T A A C T T T T A C C C A C A T C A G T A T t T T C C T C 18rs21_ai lseq 

1201 A T A T A G T.C C T T T-A C C A G C G C A T A A C T T T T A C C C A C A T C A G T A T T T T C C t C cohl_ail seq 

1200 ATATACTGCTTTACCAGCGCATAACTTTTAGCCACATCAGTATTTtBcTC cjblll.ailseq 

1201 A T A T A C T G C T T T A C C A G C G C A T A A C T T T T A CCC ACATC AGTATTTTCCTC neo316_ail"seq 
1201 A T A T A G T G C-T T T A C C A G C G C A T A A C T T T T AG CCACATCAGTATTTTCCTC a909laii.seq 

*. • 

G A AACTTAATTCTAGT AATTTTGTTAAGTAAACAACACTTAAGTTCTT T T Majority 

.. * 1 ; « 1 r* ^ J 

- - 1260 1270 12SO 1290 1300 

250 GAAACTTAATtCTAGTAATTT.TGTTAAGTAAACAACAGTTAACTTCTTTT 2603_ail seq 
019 GAAACTTAATTCTAGTAATTTTGTTAAGTAAACAACAGTTAA.GTTCTTTT 18rs21_all seq 

251 C A AACTTA ATTCT ACTA ATTTT'CTTA ACTAA AC A ACAGTTAACTTCTTTT cohl_ail seq 

250 G A. AACTTAATTCTAGTAATTTTGTTAAGTAAAC A. ACAGTTAACTTCTTTT cjblll__ail seq 

251 GAAACTTAATTCTAGTAATTTTGTTAAGTAA AC A ACAGTTAACTTCTTTT nem316_ail seq 
251 GAAACTTAATTCTAGTAATTTTGTTAAGTAAACAACAGTTAAGTTCTTTT a909_ail.seq 

C A C C T CT T A G G G C A G G G A T T G A. A C A T G A G G T A A C A C T G G A T G A T G G G A G G Majority 
. ^ 1310 1320 1330 .1340 " 1350 

300 G AG C TC TT.AGG GC AG GGATTG A A.G.A TCA.CCT A AC ACTCGATGATGGGAGG 2603 aii seq 
069 CAJG C.T CTTAC G.G C~A G G G. A T T G A A 0 A T G A G G T A A C A C T G G A T G A T 6 G G A G G 18rsSi_aU seq 

301 C AG C T C.T T A G.G C C AG GGATTG A.A. G AT. G -A GG.T A ACA.CT G G AT GAT G G G A G G cohi_aii;.seq * 

300 C A G C T C T T AC G G C.A G CCA T.T G.A.A G A T GAG G T A A C A C T G G A T G. A-T G G G A G G c jblirail;seq 

301 C A*? C TCT T-A C C C C A G C G.ATT G A A G A t G KJ^G j: A A C A C T C G A:T G A^TttCA Glj^U^ 
301 G-A GC T CT T A G G G-C A G G 6 A T T G A A CATC AG CT A A C AC TC G^T G ATC G G A G G a9& ^i^seq 

% , ■ * ' ' • . z . • 

C G A T T : A A T T T C T T.C CTTT A A C A G T T.C A G T G TTACCCA C-C T T-A A. C C A G ATC Maj or i ty 

^,360 1370 1380 . 1390 . 1400 

350' C G A T T A A T t T C T T C C T T T A A C A C T T C A C T G T T A C C C A G C T T A A C G A C A T C 2603_ail.seq 

119 CGATTAA.TTTCTTG CTT'TAAC A G T t G AGTGTT A CCCAGCTT A ACGAC ATC 18rs21 ail.seq 

351 C G;A TT A AT TTCTTCCTTTAACACTTC.ACTCTT AC C C A G C T t A A C G A G A T C eoM_all.seq 

350 CGATTAATTTCTTGCTTTAACACTTGACTGTtACCCAGCTTAACGAGATC. cJblll^alKseq 

351 p.C A T T A A T T T C T T G C T T T A A C A G T.T C A G T G T T A C C C A G C T T A A C G A G A T G ne»316_ail .seq 
351 C GATTAATTTCTTG CTTTA AC A'GTTG A GTGTT ACCC AGC.TTA ACG AG ATC a909_ai 1 .seq 



FIGURE 18 C 




j^p^^,,, WO 2006/078318 

mgnmetrt Heport of AM_aitgnmeni, using J. netn method with Weighted residue weight table 
hursday, Jufy 29. 2004 5:46 „Pf fl 




PCT/US2005/027239 
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r A A T C tgCTff gWg^gr^g TjKg rf- A »fr.e»X G%C GGTAACTCAAAAC AGTTTT Majority 



Page 5 



450 
219 
451 
450 
451 
451 



5O0 
269 
501 
SGO 
501 
SOI 



550 
U9 
55* 
550 
551 
551 



JOO 
(69 

>oi 

JOO 
KH 
>01 



A A 



1410 



1420 



1430 



1440 



1450 



400 AATAATGTCATTGAGATCGTTTAAAACAGTGGGTAACTCAAAAGAGTTTT 2603„ail seq 
169 AATAATGTGATTGAG. ATGGTTTAAAACAGTGGGTAACTGAAAAGAGTTTT 18rs21_all seq 

401 AATAATGTGATTCAGATGGTTTAAAACAGTGGGTAACTCAAAAGAGTTTT eohl__ail seq 

400 AATAATGTGATTGAGATGGTTTAAAACACTGGGTAACTGAAAAGAGTTTT cjblll_ail seq 

401 AATAATGTGATTGAGATGCTTTAAAACAGTGGGTAACTGAAAAGAGTTTT neai3i6_ail seq 
401 AATAATGTGATTGAG A T.GCTTTAAAACACTCCCTAACTCAAA ACAGTTTT a909_all.seq 

TCTTAGTATGTTTTAGGTGAAGAACAATATCAGGATCCGCAACAATCTGT Majority 

* 1 " 1 i ~ 1 • 1 — r J 

1460 1470 1480 1490 1SOO 
J J 1 1 t 



150 
119 
151 
150 
151 
151 



CO 
£9 
Ol 
XX) 
Ol 
Ol 



TCTTAGTATGT TT TAGGTGAAGAACAATATCAGGAT C.C GCAACAATCTGT 2603_ail seq 

TCTTACTATCTTTTAGGTGAAGAACAATATCAGGATCCGCAACAATCTGT 18rs2i_ail seq 

TCTTAGTATGTTTTAGGTGAAGAACAATATCAGGATCCGCAACAATCTGT cohl_aiKseq 

TCTTAGTATGTTTTAGGTGAAGAACAATATCAGGATCC GCAACAATCTGT cjblll.ail seq 

TCTTACTATGTTTTAGGTGAAGAACAATATCAGGATeCGCAACAATCTGT ne^lG^aiKseq 

TCTTAGTATGT T .T T A G GTGAAGAACAATATCAGGATCCCCAACAATCTG T a909_all.seq 

TCTGACTCTTC TAATAAATGATTCATGGCTTGTTGGCAACTAGCCTCAAA Majority 



1510 
— i 



1520 

-L. 



TCTGACTCTTCTAATAAATGATTGATG 
TCTG ACTCTTCTAATAAATGATTGATG 
T C T GACTCTTCTAATAAATGATTGATG 



1530 
i 



1540 



1550 



CTTGTTGGCAACT. AGCCTCAAA 2603_ail.seq 
C TTGTTGGCAACTA G.C CTCA A A 18rs21_ail.seq 
,,«C TTGTTGGCAA C T AGCCTCAAA cohl_all.seq 
TCTGACTCTT.CTAATAAATGATTCATGGCTTGTTGGCAACTACCCTCAAA cjblll all seq 
T C.T G A C.T C -T.T CTAA.TAAATG ATT G A TGGCTTGTTGGCAACTAGCCTCA AA riemSielail seq 
TCTGACTCTTCTAATAA.ATGATTGATGGCTTCTTGGCAACTAGCCTCAAA a909_all seq 



CTCTC TTTGGAAAAAGGCATCGATAGACACAAGAAGACTACGTATACTGG Majority 

1590 1600 



1560 



i 

1570 



1580 



C T G T G T T T.G G A A A A A G G C A T C G ATA C A C AC A A G A A G ACT A CC TA T A C T C G 2603_all.seq 

CTGTGTTTGGAAAAAGGCATCGATAGACACAAGAAGACTACG T A T A C T G G 18rs21_ail seq 

CTGT GTTTGGAAAA AGGCATC GAT AG A C A C A A G A A G A C T A C G T A T A C T G G cohl_ail seq 

C T G T G T T T G G A A A A A G G C A T C G.A T A G A C A C A A C A A G A C T A C G T A T A C T G G cjblll_ail seq 

C T G T.G T T T G G A A A A A G G C A T C G A T A G A C A C A A G A A G A C T A C G T A T A C T G G nea316_ail seq 

CTCTCTTTGGAAAAAGGCATCCATAGACACAAGAAGACTACGTATACTCG a909_ail.seq. 

TAGT ACCAAAACAAGGCACAAGCTTTATATAGGATAACATTTCTTTTTTA Majority 

1610 1620 1630 1640 1650 

■ ' : 1 1 L_l i l_ 

TAGTAGGAAAACAAGGGACAAGCTTTATATAGGATAAGATTTCTT T T T T A 26031ail.seq 

TAGTAGGAAAACAAGGGACAAGCTTTATATAGGATAAGATTTCTTTTTTA 18rs21_ail seq 

TA.GTAGGAAAACAAGGGACAAGCTTT'ATATAGGATAAGATTTCTTTTTTA cohl_aii seq 

TAGTAGCAAAACAAGGGACAAGCTTTATATAGGATAAG.ATTTCTTTTTTA ejblil„ail.seq 

TAGTAGGAAAACAAGGGACAAGCTTTATATAGGAT A AG ATTTCTTTTTTA nem316_ail.seq 

TAGTAGGAAAACAAGGGACAAGCTTTATATAGGATAAGATTTCTTTTTTA a909_aii.seq 

CTACGATGAC AAAATTGTTCTACAAAGCGACTGCATAACTGTTCTT G. C C T Majority 



1660 



1670 



1680 
JL 



1690 
a. 



1700 



T A C GAT G A G A A A A" T T G T T C T. A G A. A AG C G A C.T G G A- T A A C T G T T CTTfiCCT 2603_ail seq 

G T.A C G A T G A G> A A A ti G T T C T A G A A. A.X5 C G A G T G G A T A A C T.G f t C T T GC C T i8rs2l^ail seq 

C T A C C A T C16 A AAA T T G T T C T A G A A A G C G A C T G G .A t.A A'CT.C TT.C T T G C C T cohi_jait seq 

CT A C -CATC A (I A AAA T.T fit. TP.T A fi A A A fi n n.A n tt h. n » • *. * n * .ti ' k> » r» V 4- r* ^ i, 

G T.A C. 
0 TA C 

ATT q A T A t C A C GGC TAT A G G G A T A A A A T G G T C C A A T A; G.C A A T A A.G' A.T A T:T' Maj ori ty 

' • _ 



T 

1710 
_1 



1720 



1730 

JL. 



1740 

JL 



1750 



A T T G A T A T C A G G G C T A T A G G G A T A A A A T G G T C C A A T A C C A ATA A G.A I ATT 2603iail.seq 
ATT GATATCACGGCTAT AG G G AT A A AA T G G T G C A A T A G C A A T A A G A T A T T 18rs21_aU.seq 
ATT G ATATCA.GG'GCT AT A GG G A T A A A A T GG TCGAATAGCA ATAAG ATATT eohLail.seq 
ATTGATATCAGGGCTATAGGGATAAAATGGTCCAATAGCAATAAGATATT cjblll_aU.seq 
A T.T G A T A T C A G G G C T A T A G G C A T A A A A T G G T C C A A T A G C A A T A A G A T A T T Ae*316_an .seq 
A'T TGATATCAGGG C*T AT A GGG AT AAA AT GGTCCAA T-A GCAATAAG.ATAT'T a909_ail.seq 
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a^t 



"h ""H* '""l! '"" h ji'"}t 



T e r'TCAAAAAC ACCCTCATAAACA Majority 



750 

519 

751 

750 

751 

751 



1780 



1790 



„ -— — — — , . . — 1800 

GACAGACAGGAAAAAT TAAGAATrATTf TTf a a . , . ^ 1 i 

CACACAClCCA*AAATIAAClATCATTCTTr«ijiir 2603_ail.seq 
CACACACAGtAAAAArTAAGAJIclTTnTTfiii? ISrsZ^alHeq 
OACACACACCAA "1" TC J r^J AfiACCCTCATA14CA coh._all.seq 

OACACACACUAAA^^^A A U »^^J AAA ^~e, 

CACACACAGGAAAAATTAAGAATGATTCTTCAAAAAG A__ CCTCATAAACA aM^l.aTT'seq 6 '' 



800 
569 
801 
800 
801 
801 




CTGATATCTTGGtT 
GTGATATCTTGCTT 
GTGATATCTTGGTT 
CTGATATCTTGGTT 
CTGATATCTTGGTT 
CTGATATCTTGGTT 



A T A A 

A T A A 

A T A A 

AT A A 

A T A A 

A T A A 



GGGATAGCTAAATGTTTTAAAAACTGATACTA 
GGGATAGCTAAATCTTTTAAAAACTGATAGTA 
CGGATAGCTAAATGTTTTAAAAACTGATACTA 
CGGATAGCTAAATGTTTTAAAAACTCATAGTA 
GGGATAGCTAAATCTTTTAAAAACTGATAGTA 
GGCATAGCTAAATGTTTTAAAAACTCATAGTA 



A G G C A A C A G A T AGTCTTCGT T A C C 

— 1 : 



2603_ail.seq 

18rs21_ail.seq 
cohl_all .seq 

cjblll_ail.seq 
neai316_ail.seq 
a909_ail. seq 



850 

619 

851 

850 

851 

851 



T 

1860 
I 



AGCC AACAGATA 
AGGCAACAGATA 
A C C C A ACAG ATA 
AGGCAACAGATA 
A G G C A A C A G A T A 
A G G.C AACAGATA 



1870 
t 



ATATAACTCAACCAGT T CCTTCTC.TC Majority 



G T C 
G T C 
G T C 
G T C 
C T C 
GTC 



T T C 
TTC 
T T C 
TTC 
T T C 
TTC 



G T T 
G T T 
G T T 
G T T 
G T T 
G.T T 



A 
A 
A 
A 
A 
A 



c e 
c c 

cD 

c c 
c c 
c c 



1880 
i 



i 

1890 
—J — 



ATATAACTGAACGAGTTCCTTGT 
ATATAACTGAACGAGTTCCTTGT 
ATATAACTGAACGAGTTCCTTGT 
ATATAACTGAACGAGTTCCTTGT 
ATATAACTGAACGAGTTCCTTGT 
ATATAACTGAACGAGTTCCTTGT 



1900 
— 1_ 



C T 

C T 

C T 

C T 

C T 

C T 



C 
C 

c 
c 
c 
c 



G T G A CAT 



2603_ail.seq 

18rs21_ail_seq 

cohl_jall-seq 

€jblll__ail.seq 

nemSie^ail.seq 

a909_all.seq 



C A C T G A A 

1 : 



ATA GG T 

— p- 



ACTTG A G A T A T 



900 

669 

901 

900 

901 

901 



1910 



1920 
— i 



GTGACATGACTGAAATAGGT 
GTGACATGACTGAAATAGGT 
GTGACATGACTGAAATAGGT 
GTGACATGACT.G. AAATAGGT 
GTGAGATGACTGAAATAGGT 
G T G A C A T G A C'T.G AAATAG-GT 



1930 
— i 



GGTATG CAATCTTTGAACA Majority 



AGTTGAGATATG 
AGTTGAGAT A'TG 
AGTTGAGATATG 
AGTTGAGATATG 
AGTTGAGATATG 
AGTTGAGATATG 



G T 
G T 
G T 
G T 
G T 
G T 



1940 
i 



A T G 

A T G 

A T G 

A T C 

A T C 

A T G 



G A 
C A 
C A 
C A 
C.A 
C A 



ATGTTT 
A T G T T T 



1950 
i_ 



A T G 
A T G 
A T G 
A T G 



T T T 
TTT 
T.T. T 
TTT 



G A 

G A 

G A 

G A 

G A 

G A 



A 
A 
A 
A 
A 
A 



C A 
C A 
C A 
C A 
C A 



2603^aU.seq 
18rs21__ail.seq 
Cohl__ail.seq 
cjblll_ail.seq 
neffl316_ai 1 .seq 



TGTTTAAAATCCAATGTAACCATTTGATAGACCGC 



C.TTCA 



950 
719 
951 
950 
951 
951 




T 
T 
T 
T 
T 
T 



GTTTA 
GTTTA 
G T T T A 
GTTTA 
GTTTA 
G T T T A 



A 
A 

A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A T C 

A T C 

A T G 

A T C 

A T C 

A T C. 



G 
G 
G 
G 
G 
G 



A A T G T A 
AATGTA 
AATGTA 
AATGTA 
AATGTA 
A A T G T* A 



A C 
A C 
A C 
A C 
AC 
A C 



C 
C 
C 
C 
C 

c 



A 
A 
A 
A 
A 
A 



TTICATA 
TTTGATA 
T T T G A T A 
TTTGATA 
TTTGATA 
TTTGATA 



G 
G 
G 
G 
G 
G 



A C 
A C 
A C 
A G 
A C 
AC 



C 

c 
c 
c 
c 
c 



G 
G 
G 
G 
G 
G 



C C T 
C C T 
CCT 
C C T 
CCT 
CCT 



T C 
T C 
T C 
T C 
T C 
T C 



A T T A T C 
A T T A T C 
A T T A T C 
A T T A T C 
A T T A T C 
ATTATG 



ATT 
ATT 
ATT 
ATT 
ATT 
ATT 



T A G 



AATTTTTCTTTAGGTTT G T A A 
* • ~ i 

2010 



C A a909__ail.seq 



Majority 



2603_ail.seq 
18rs21_all .seq 
cohl_all.seq 
cjblll_all.seq 
nen316_jail.seq 
a909_ail .seq 



T C 
T C 
T C 
T C 
T C 
T C 



300 
769 
XH 
500 
301 
301 



T 
T 
T 
T 
T 
t 



A G 
AC 
AG 
A G 
A G 
AG 



A A C 



A AT t T T T C T T T A G G T T T G T A A 
A AT T TXT C TTT A GGT T T G T A A 
A A T T T T T> C T T T A G G TTT G T A A 
A ATT T T f C -T. T T A GGT T. -t C T A A 

A A T T t T t C;T,T tlA'G 6,T:-T'K'T;'M : 
A' A T T T TTC TTT A G GT'T T G f A A' 

:T A TC TT GTG G A T A C ACT AAA A 
2060 2070 



AGACT. ACAAAATA AAATGATGAAAAC Mafor^ 

2030 2040 . 2050 

A A C T A C A A A 
AG ACT AC AAA 

A C A jCT .ACC A A A 
A G. A G. T A C' A- A. A 

a : m. a c ;T?#e -A ; $:.Ar 
•a^a^t;*^-^^ 



A G A C A C- C .;C , T A 
2080 



A T A A A A t G A T G A A A A C 2603_^ail.seq 
AT A A A AT G ATGAAA AC! 8rs2 ii_a i JL~L Seq 
A T A A A A T C A t G A A* A A C fcohl 
A}T Av A A A T;G A T- G A A A A^ C ^bfil^W.sfeq 

A;;T-^ A K A ^ft JG, . A, A W ^Cy^'featfe^ 

^ : t : a a a :a^:-g ; a ;; t "g; a- a .a'*%. : -S^Mu^P'" 

* . » . 

A T T A G C K A A C T C T C T C Majority 



T G T G 
T C T G 
T G T.G 
TGTC 
XG.TG 
T G T G 



A G A C A 
A G A C A 
A G AC A 
A G A C A 
A G A G A 
A G AC A 



CGCTA 
C G.C T A 
CGCTA 
C G C T A 
CGCTA 
CGCTA 



2100 



ATTAGCAA ACTCTCtC 
ATTAGCAAACTCTCTC 
ATTA.G CA A ACTCTCTC 

ATTAGCAAACTCTCTC ^ . . , _ 

ATTAGCAAACTCTCTC nea316__ail.seq 

ATTAGCAAACTCTCTC a909__ail seq " 



2603_ail.seq 
I8rs21^all.seq 
cohl_ail_seq . 
cjbill_ail.seq 



Page 6 



.• * 



FIGURE 18 E 

r 





Wtgnmert Report of AI-1_|WQ_2006A)783 18^^ wfth Weighted r<?sidue weight ^ PCT/US2005/027239 
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C A ifr-fltT. CliT„€ A J fclAfl X:!A.,T X:a.,tr fllff -tiW TATTTATATCICA AATAAACC Majority 



T T C A T 



2110 2120 2130 2140 2150 

2100 TTCATCATCTCTCACCATTATTATACTACTATTTATATGACAAATAAAGG 2603_ail.seq 
1869 TTCATCATCTCTCACCATTATTATACTACTATTTATATGACAAATAAAGG 18rs21_ail .seq 

2101 TTCATCATCTCTCACCATTATTATACTACTATTTATATGACAAATAAAGG cohi_ail.seq 

2100 TTCATCATCTCTCACCATTATTATACTACTATTTATATGACAAATAAAGG cjbl 1 l_ai 1 .seq 

2101 TTCATCATCTCTCACCATTATTATACTACTATTTATATGACAAATAAAGC nem316_ail .seq 
2101 TTCATCATCTCTCACCATTATTATACTACTATTTATATCACAAATAAAGC a909_ait.seq 

TGAfTT.TGTTAAAAATATAACTTTCAAAATCCACATATATTTTTAATCTT Majority 

2160 2170 2180 2190 2200 
-* mm » : 1- 1 : %. 



5150 T G A T T T T G T T A A A AHt A T A A C T T T G A A A A T- C C A C A T A T A T T T T T A A T C T T 2603_ail.seq 
l 9 A 9 ^ATTTTGTTAAAAHTATAAGTTTGAAAATCCACATATATTTTTAATCTT 18rs2l_ail.seq 



2151 Til A TTTTGTTAAA AH TAT AACTTTG AAA ATCCACATATATT T T T A A T C T T cohl_ail.seq 

2150 T C A T TT TGTTAA AAATAT AACTTTGAA AATCCACAT ATATTTTTAATCTT cjbUl_ali.seq 

2151 TGATTTTGTTAAAAATATAACTTTGAAAATC C AC AT AT ATTTTTAATCTT ne«316_ali:seq 
U51 TGAT.TTTGTTAAAAATATAACTTTGAAAATCCACATATATTTTTAATCTT a909_ail.seq 

C C. C T C T G A A A A A A. - T A A A T A A A A A T A G T A A A A A T A A A G A C G A A T T T A A A A Majori ty 

2210 2220 2230 2240 2250 
i t ■ t i 

5199 C C GTCTCAAAAAA-TAAATAAAAATAGTAAAAATAAACACGAATTTAAAA 2603„ail.seq 

1968 C C G T C T G AAAAAA-TAAATAAAAATAGTAAAAAT.AAACACGAATTTAAAA 18rs21_all.seq 

200 C C G T C T C A A A A A A - T A A A T A A A A A T A G T A A A A A T A A A C A CG A A TT T A A A A cohl_ail,seq 

200 CCCTCTGAAAAAA -TAAATAAAAATAG T A A A A A T A A. AC ACGAATTTAAAA cjbUl_aii.seq 

201 CCGTCTGAA AA A aQt A A A T A A A A A T AG T A A A A AT A A A C A C GA ATTTAAAA nea316_ail.seq 
201 C C G T C T G A A A A A A - T A A A T A A A A A T A GT A A A A A T A AAC ACGAATTTAAAA a909„ail.seq 

T A A G C A A A T TT TTTAAGAAAATCTGTGCTAAACTTTAATA CT TTTGTGCT Majority 
: ■ | : — p- ■ r , p- J 

2260 2270 2280 2290 2300 
; 1 : ; — , 1— . I , • » - t ■ 

248 T A A G C A A A T T T T T T A A G A A A A T C T G T G C T A A A C T T. T A A T A G T T T T G T G C T 2603_all.seq 
»17 T A A G.C.A A ATTTTTTA AGAAAATCTGTGCT A A AC TTTAATAGTTTTGTGCT 18rs2i_ail.seq 

249 TAAGCAAATTTTTTAAGAAA. ATCTGTGCTAAACTTTAATAGTTT T G T G C t cohi_ail.seq 
249^ TAAGCAAATTTTTTA A CAAAATCTGTGCTAAAC TTTAATAGTTTTGTGCT cjblU^ail.seq 
251 T A AC C A A AT T T t T T A A G A A A ATCTGTGCTAAACTTTAATAGTTTTGTGCT nem316_all.seq 

250 TAAGCAAATTTTTT A A G AAA. ATCTGTGCTAAACTTTAATAGTTTTGTGCT a909_ail.seq 

T A A TAATAATCAGCACTTACAAAGAACAAACGGA A A A G C G AG GACAGAAC Majority 

2310 2320 2330 2340 2350 

298 TAATAATAATCAGCA.CTTACAAAGAACAAAGGGAAAAGCGAGGAGAGAAC. 2603_ail.seq 
067 TAATAATAATCAGCACTTACAAAGAACAAAGGGAAAAGCGAGGAGAGAAC 18rs21_jaiKseq 

299 TAATAATAATCAGCACTTACAAAGAACAAAGGGAAAAGCCAGGAGAGAAC oohl_ail.seq 

299 T A A T A A T A A T C A G C A C T T A CA A A G A A C A A A G G G A A A A G C G A G G A G A G A A C cjbill_ail.seq 
301 T A A T A A T A A T C A G C A C T T A'C A A A G A A CA A A G G.G A A A A G C G A G G A G A G A A C nem316_ail.seq 

300 T A AT A A T A-A T CAGCACTTACAAAGAACAAAGGGAAAAGCGAGGAGAGAAC a909_aiKseq ' . 

TTTTAATGAAATTATCG AAGAAGTTATT.GTTTT CG G C T. GCTGTTTTAACA Majority * 

* 1 " - • T 1 '■ • r ' ' 1 • ^ 

2360 - • 2370 . 2380 2390 2400 
— — * . — l . -1L i ^ — la. * 

348 T T T T A A T G A A A T T A T C G A A G A A (?.T'T ATT GTT.TTCGG C T G C T G t TT T A A C A 2603_ail.seq . . 
117. TTTTA A T;G A A A T TAT CG A A G A A G XT A T-T G T TT T C !G G ^ T.G C ±G T i T T A A C A 18rs2li.aii.ieq - 

349 TTTTA A T G A A A T T A T C G A A G A-A G T -T A T T £ T T t T C G/G C T G G T G T T T T A A C A. isohUaM.seq 

349 T T T T-A" A ^ *^ A; & A T T" "* t r r. i A h a -i h t.* *.**r -f i^'*p."*r = "«p t- &'**' 'i\.-dfr *r m is' »i» V «i» ' ** 1 r»--'« /u**-*'i'k- *— ■ * \L • 

351 TT-TT.AA 

350 t - T T T A A 

■ * 

AT G G T G G C G. G G G T C A A : C T G. T, T G/ A A h C A G: : T 1 A 'G C T. C A G^T T. T : G C C ACT G. G K- kTX. Majority 

2410 2420 2430 2440 2450 



398 
167 

399 



ATGGTCG C G G G G T CA A C TG T T.G A A fc.C'A C TAG C T € AG TTTGC GA.C T G G A A f 2603_ail.seq 
ATG GTGGCGQG GTCAAC T GT T G A. A C C ACTA GCTC AGTTf GGG ACTGG A AT 18rs2Uall seq . 
A T G G T G G.CG'G G-GTCAAC'T-GTTCAAC G AGT A G CTCAGTTT G-CGACT G'G A-AT cdhl_ail.seq 

399 A TQG T G G CG G. G G T C A A C T G T T C A A C C A G T A G C T C A G T T T G C G A C T G.G A AT cjblli^ail.seq 
401 AT G G TCGCGCGCT C A ACTGTTGAACCAGT AGCTCAGTTT GCGACTGGAAT. nem316_all.seq 

400 A T||G T G G CG G G G T C .AAC T G TtG AACCAGT A GC TCAGTTT G C G A C T G G A A T a909_ail .seq. 
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2448 
22X7 
2449 
2449 
2451 
2450 



5498 
5267 
J499 
M99 
!501 
!500 



!548 
317 
1549 
549 
551 
550 



598 
367 
599 
599 
601 
600 



B48 
417 
549 
549 
551 
550 



>98 
167 
J99 
J99 

ror 
roo 



'48 

;i7 

r 49 

'51 
50 



G 
G 
G 
G 
G 
G 



A G T 
ACT 
ACT 
ACT 
ACT 
A G T 



~i — 

2460 
i 



T 

2470 



A T T G T 

A T T G T 

A T T G T 

A T T G T 

A T T G T 

A T T G T 



AAGAGCTGCAG 
AAGAGCTGCAG 
AAGAGCTGCAG 
AAGAGCTGCAG 
AAGAGCTGCAG 
AAGAGCTGCAG 



AAGA ACCCC C AGC G A A A A C 

2480 

i . 



AAGTGTCACAAGA 
AAGTGTCACAAGA 
AAGTGTCACAAGA 
AAGTGTCACAAGA 
AAGTGTCACAAGA 
AAGTGTCACAAGA 



T 

2490 
i 



ACCCCC. AG 
ACGCCCAG 
ACGCCCAG 
ACGCCCAG 
ACGCCCAG 
ACGCCCAG 



A A 
j_ 

2500 

L_ 



Majority 



CGAAAACAA 
CGAAAACAA 
CGAAAACAA 
CGAAAACAA 
CGAAAACAA 
CGAAAACAA 



2603_a i 1 . seq 
18rs2I_ai I .seq 
cohl_ail .seq 
cjbl H_ai 1 .seq 
nen316_ai I .seq 
a909_a i 1 .seq 



CAGTAAATA 



TCTAT A 
1 — 



A A T T A C 
— j 



A A G C 



T G A T A G 
1 



T T A T A A 



2510 
j 



2520 



A T C G 
1 



CAGTAAATATCTATAAATTACAAGC 
CACTAAATATCTATAAATTACAACC 
CAGTAAATATCTATAAATTACAAGC 
CACTAAATATCTATAAATTACAACC 
CAGTAAATATCTATAAATTACAAGC 
CACTAAATATCTATAAATTACAACC 

TCTAATGCTGGTATCGAGAATAA A G 

i 1 



2530 
i 



2540 



GAAATTAC T 
i ■ 



Majority 



T 
T 
T 
T 
T 
T 



G 
G 
G 
G 
G 
G 



A T A G 

A T A G 

AT A G 

A T A G 

A T A G 

A T A G 



T T A 

TT A 

T T A 

T T A 

T T A 

T T A 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A T 

A T 

A T 

A T 

A T 

A T 



2550 



C G 

C G 

C G 

C G 

C G 

C G 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



A A T T 

A A T T 

A A T T 

A A T T 

A A T T 

A A T T 



A 

A 
A 
A 
A 
A 



C 
C 

c 
c 
c 
c 



T 
T 
T 
T 
T 
T 



2603_all.seq 

18rs2l_all.seq 

cohl_ail.seq 

cjbUl_all.seq 

nem3 I6_a 1 1 . seq 

a909_ail.seq 



2560 
i 



A C G G C G 
■ 1 



A A C T A 



T C T A A T G G T G G T A T'C. G A 
TCTAATGGTGGTATCGA 
T C TAATGGTGG T A T C G A 
TCTAATGGTGGTATCGA 
TCT A A T G GTG'GTATCGA 
TCTAATGGTGGTATCGA 



2570 
i 



ATATCTAA 
1 1 



G A A T 

G A A T 

G A A T 

G A A T 

C A A T 

G A A T 



2580 
l : 



CTATCC Majority 



2590 
i 



2600 



A 

A 
A 
A 
A 

A 



A 
A 
A 
A 
A 
A 



A G 
A G 
A G 
A G 
A G 
A G 



A 
A 
A 
A 
A 
A 



C 
C 
C 
C 

c 
c 



G G 

G G 

G G 

G G 

G G 

G G 



C G 
C G 
C C 
C G 
C.G 
C G 



A A 
A A 
A A 
A A 
A A 
A A 



C 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



A 

A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



TATCTAACTA T.GC 
TATCTAACTA T G C 
TATC T A A C T A T G C 
T ATCTAA CTATCC 
T A T C T A A C T A T G C 
TATCTAACTATGC 



2603_all.seq 
18rs21_all.seq 
cohl_ail .seq 
cjblll_ail-seq 
neffl316_all.seq 
a909_all .seq 



TAAACTTGGTG 

■ " 



ACAATGTAAAAG 



G T T 



2610 

r 



T G C A A 
— , 



CCTGTA 



2620 



CACT T T 
1 



AAACCTT Majority 



2630 



2640 



2650 



T AAA CTTGC.TCACAATC 
T A A A CTTGGTGACAATG 
T AAACTT GGTG ACAATG 
T.AAACTTGGTGAC A A T G 
TAAACTTGG TG A C A AT G 
TAAACTTGGTG ACAATG 



TAAAAGGTTTGCAAGGTGTACACTTTA 
TAAAAGGTTTGCA.AGGTGTACAGTTTA 
TAAAAGCTTTGCAAGpTGTA CA. GTTTA 
TAAAAG CTTTGC AAtiGTCTA CA GTTTA 

T A A A AGCTTTCC AAGGTGT A CAGTTTA 
T A A A AG GTTTGCAA G G T C T AC A G T T T A 



A AX G 

A A C C 

A A C C 

A AC C 

A A C G 

A A C C 



T 
T 
T 
T 
T 
T 



T 
T 
T 
T 
T 
T 



2603_ail.seq 
18rs21_ail_seq 
cohl_ai 1 . seq 
cjblll^aiKseq 
nen3 16_ai 1 . seq 
a909_ail .seq 



ATA A A G T 



C A A G 
1 



2660 
i 



ACGGATAT T TCT 

, 



G T T 



G A T C A 



A T T G A 



2670 



A A A A A 
1 



T T G 



ATA 
ATA 
ATA 
ATA 
ATA 
ATA 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



G T 
G T 
G T 
G T 
G T 
G T 



2680 
i 



2690 



ACAACA Majority 
2700 



C A 

C A 

C A 

C A 

C A 

C A 



A 

A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



C 
C 

c 
c 
c 
c 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



A T 
A T 
A T 
A T 
A T 
A T 



ATT 
ATT 
ATT 
ATT 
ATT 
ATT 



T C 

T C 

T C 

T C 

T C 

T C 



T 
T 
T 
T 
T 
T 



G T 

G T 

G T 

G T 

G T 

G T 



T 
T 
T 
T 
T 
T 



G A 
G A 
G A 
G A 
G A 
G A 



TGA 
T G A 
T G A 
T G A 
T G A 
T G A 



A 
A 
A 
A 
A* 
A 



T T 

T T 

T t 

T T 

T T 

T T 



G A 

G A 

G A 

G A 

G A 

G A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



T T G 

T T G 

T T G 

T T G 

T T G 

T T G 



A 
A 
A 
A 
A 
A 



C A A C A 
C A A C. A 
C A A C A 
C A A C A 
CAACA 
C A A CA 



2603_all.seq 
18rs21__all.seq 
cohI_ail .seq 
cjblll_ail.seq 
nera3 16__al 1 . seq 
a909.lail.seq 



GTTG A A 



CCACC 
* 1 



A G A T G C A A A A G T 



T G G A A C G, A T T C T T 



2710 
j 



2720 



G A AG. A AG G T 

: 



G T C A G Majority 



C T T 
G T T 
G T T 
G T .T 
GIT 
G T T 



G A A 
G A A 
G A A 
C A A; 

G'-jA a: 

C A A 



G.CAGC 
G c A^C 
G ;C A G C 
0 
G 

G/C A G^C 



c;a g c 

irk GiC 



AG AT G C A A A A G T 
A G A T G C A 'KjX ACT 
A.h A,T .C C A A A A.G „T 
A/G A T, G C A A A A G.T 

^•^•A: : t':g;c a a :A a g t 



2730 
i 



2740 



TCGAACGATT 
Q A AC G A TT 
t! Gl G A A G . G ATT 
T G G A A CG AT 




C T T G A A G A A G G.T 
CTT G A AG A ACG T 
C T T G. A. A G A A G G T 
T jT G A'A G A A G G .T 
jG T T:G; A A £ A $iC G T 
GIT'T G A A G A A G G ^r: 



2750 



G T C A . jG 
G T C AG 

G .T C A G 
GT.e.AG 
G'T,C -AG 



2603^ail,seq . 
18rs2I^aM.seq 
'Coht_ail .seq . 
cj|b;l ll^al 1 . 4se<| 
ne^l6ljai lvseq 



CTC AG a909_:ail..seq 



T IC. T A C • C T 'C 



A A A: A; A A' C 



T A AT G C XCAAG 

T— 1= — 



TCT AC CT C 
T CT A C CTC 
T'C f A C C T C 
t CT ACCTC 
TCT A C CTC 
T C T A C C T C 



2760 



CT T T G G 
1 — 



2770 



T C G T C CATC CT C T 

r- — r ~i ' — 



G G A T T Majority 



2780 



2790 



A A A A A AC 

A A A A A AC 

A AAA A AC 

A A A A A A C 

A A A A A A C 

AAA A A A C 



TAATG C T C A A G 
f A A T.G C T CA AG 
t A A T G C T C A A G 
T A AT GC'.TCAAG 
T A A T G C T.C A A G 
T A A T G C T C A A G 



G T TTG'G 
GTTTGG 
G.T T T G G 
GTTTGG 
GTTTGG 
GTTTGG 



2800 



TCGTC G ATC.CTCT 
TCGTCGATGCTCt 
TCGTCCA-TGCTCT 
TCGTCGATGCTCT 
T'C GTCCA'TG'CTCT 
TCGTCGATGCTCT 



G .G A T T 
C.GATT 
CG A TT 
G-G A T T 



2603^ail.seq 
18rs21_ail,seq 
coh l_a i 1 . seq 
cjblil_ail.seq 
GG ATT nem316_ail.seq 
G G A T f a909_ail.seq 
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FIGURE 18 G 



M^ertRepittof residue weight tabte w PCT/US2005/027239 

Thursday, July 29. 2004 5:46 PM 

C ^AAAy g^MSBj Btti gllBSS fed felk GAAGATTTAAAGAATTCACCT Majority ~ 



2798 
2567 



949 
951 
9SO 



998 
767 
999 
999 
001 
000 



2810 2820 2830 2840 2850 

-* — 1 1 1 L ~ 



CAAAAACTAATGTGAGATACTTGTATGTAGAAGATTTAAAGAATTCACCT 2603_aU seq 

CAAAAAGTAATCTGACATACTTCTATGTACAAGATTTAAAGAATTCACCT 18rs2l_ail seq 

2799 CAAAAACTAATGTGAGATACTTGTATGTACAAGATTTAAAGAATTCACCT cohl_ail seq 

2799 CAAAAAGTAATGTGAGATACTTGTATGTAGAAGATTTAAAGAATTCACCT cjbll l_ail seq 
2801 CAAAAAGTAATGTGAGATACTTGTATGTAGAAGATTTAAAGAATTCACCT neii316_ail seq 

2800 CAAAAAGTAATGTGAGATACTTGTATGT A X *G AAGATTTAAAGAATTCACCT a909_ail.seq 

TCAAACATTACCAAAGCTTATGCTCTACCGTTTGTGTTGGAATTACCAGT Majority 

2860 2870 2880 2890 2900 
J « « 1_ 1_ 



£848 TCAAACATTACCAAAGCTTATCCTGTACCGTTTGTGTTGGAATTACCAGT 2603_ail seq 

2617 TCAAACATTACCAAA.GCTTATCCTGTACCGTTTGTGTTGGAATTACCAGT 18rs21_aii Iseq 

2849 TCAA ACATTACC AAAGCTTATGCTGTAC C-G TTTGTGTTGGAATTACCAGT cohl_aiI seq 

2849 TCAAACATTACCAAAGCTTATGCTGTACCGTTTGTGTTCGAATTACCAGT cjblll_aii seq 

2851 TCAAACATTACCAAAGCTTATGCTGTACCGTTT.GTGTTG.GAATTACCAGT nem316„ai l"seq 

?850 TCAAACATTACCAAAGCTTAT G C TGTACCGTTTGTGTTGGAATTACCAGT a909_aU.seq^ 

TCCTAA .CTCTACACGTACAGGTTTCCTTTCT G A AATTAATATTTACCCTA Majority 
2910 2920 2930 2940 2950 

>898 TGCTAACTCTACAGGTACAGGTTTCCTTTC T.G AAATTAATATTTACCCT A 2603_ail seq 

!G67 TGCTAACTCTACAGGTACAGGTTTCCTTTCTGA A A TTAATATTTACCCTA 18rs21_ali seq 

5899 TGCTAACTCTACAGGTACAGGTTTCCTTTCTGAAATTAATATTTACCCTA cohl.ail seq 

5899. TCCTAAC TX TACA. CGTACAGGTTTCCTTTCTGAAATTAATATTTA C-C C T A cjblll_allseq 

SOI TCCTAACTCTACA G G.T A C A G'C.TTTC CTTTCTGAA ATT AATATTTACCCTA neaaie.airseq 

>90G TGCTAACTCTAC AG G T A CAGGTTTCC TTTCT-GA AATTAATATTTACCCTA a909_a i 1 . seq 

AAAAC CTTGTAACTCATGAACCAAAAACACATAAACATCTTAAAAAATTA Majority 

2960 2970 2980 2990 3000 

948 AAAACGTTGTAACTGATCAACCAAAAA C A C A I A A A G ATGTT A A A A A A TT A 2603__ail seq 
tfl7 AAAACCTTGTAACTGATGAACCAAAAACAG. ATAAAGATGTTAA A|HU AfTA 18rs211ail seq 

949 AAAACGTTGTAACTGATGAACCAAAAACAGATAAAGATGTTAAAAAATTA cohi_aii seq 
A A A AC G T T G T- A A C T G A TG A A C C A A A A A C A GAT AA AC ATGTT A A AAA ATT A cjblll_ail seq 
AAAACGTTGTAA CTGATGAACCAAAA ACAGATAA.AGATGTTAAAAAATTA nem316_aiKseq 
A A A A X G TT GTAACTGATGAACCAAAAACAGATAA AG AT CTT A A A A A A TT. A a909_aiKseq 

• * * 

CCTC ACCACCA T CCAGGTTATACCATTGCTCAAGAATTCAA ATGGTTCTT Majority 
. 3010 3020 3030 3040 * 3050 

GGTC-AGGACGATGCAGGTTATACGATTGGTCAAGAATTCAAATGGTTCTT 2603__ail seq 
GGTCAGGACGATGCAGGTTATACGATTGGTGAAGAATTCAAATGGTTCTT 18rs21_aii seq 
GGTCAGGACGATGCAGGTTATACGATTGGTGAAGAATTCAAATGGfTCTT cohi_ail.seq 
GGTCAGGACCATGCAGGTTAT ACGATTGGTGAAGAATTCAAATGGTTCTT cjblll_alt seq 
CGTCA.GGACGATGCAGGTTATACGATTGGTGAACAATTC A A ATGGTTCTT nem316_aiKseq 
GGTCAGGACGATGCAGGTTATACCATTGGTGAAGAATTCAAATCGTTCTT a909_ail v seq 

G A A A TC T A C A A T C C C T G C C A AT T T A G G T G A C T A T G A A A A A T T T G A A A T T A Majority 

3060 3070" "3080 3090 3100 
■ — ■ i ' ■, • _j : i_ 

G A A A T C T A C A A T C f C C T G C C A A T T T A G.G T G A C T A T G A A A A A T T T G A A A T T A 2603_ail.seq 

G A A A T CI AG A A T C C C TG CCA ATT T;A G G T G ACT A T G A A A A A T T T G A A A T t A 18rs2liaU-seq 

G A A A TC T A C A A f CCX T G C C A A T TT A -G GTCACTAT G.A A A A A T T'TG A A AT T A dohl-ail.seq 

G A A AT C TACA At C CCT G G C A A T T T A. C : G T G A CT AT G A AAA A T T.T G A 'A A T T A cibill-alt.seq 

tei . ;G - A, A AT C T A C A A T\CX C t-G.C C. A A-T f T A G C T ;G.7a C "T A T G A A. A> A X T T T G A <K A T T A ne^l6^il.sm 

350 G A A AT C TAG A ATX C C" T G C C A A T -T ;T : A: G-. C T G AX T>A T .G " A, A AAA TT TCAA A T t A iadO^*^"' 

* C T G A T A A A T T T G C A G A T G G C T t G A C;T T A T A A AT C TvG T T G : C A A A A A T C, A A G Majority 



048 
ftl7 
349 
349 



398 

367 

399. 

399 

101 

LOO 



3H0 3120. 3130 3140 3150 
- 1 - : — — ■ = ■ ■ : — — ■ — — _i : L 



CTX AT A A ATTTG C A-G AT GGCTTGACTTATAAATXT CTT GG A AAA A TCAA G~ 2603^ail.seq 
CTGATAAATTTG C A G.ATGG.CTTGACTTATAAATCTGTTC G.AA AAATCAAG 18rs2Uall.seq 
C T G A T A A A T T T G C A G AT G G C T.T G AC t T AT A A AT C T G T TG G A A A A A T C.A A G cohll.ail.seq. 
CTXATAAATTTGCAGATGGCTTGACTTATAAATCTGTTGGAAAAATCAAG ciblll^ail seq 
CTGATAAATTTGCAGATGGCTTGACTTATAAATCTGTTGGAAAAATCAAG nem316_ail.seq 
C.Tfi A T A A AT T T C G A G AT-C G CTT G ACTT A T A A AT C T C t TG C A A A A A T C A A C a909„ail.seq 
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148 
917 
149 
149 
151 
150 



198 
967 
199 
199 
201 
200 



248 
H7 
249 
i49 
251 



£98 
>67 
»9 
39 

tot 
too 



Jgnment Report of Al-^WO 2006/0 78318 >thod wah Weighed residue weight table ^TCT/US2005/027239 
fcwsday. Jufy 29. 2004 5:46 PM ' - ^ 

A T T C G T T Cp p T A.tM|« l^'M'-A, T^iCTft j ^ lt ft. fc C ACTACACTATTGATGAACC Mai oTI^ 

31 t 60 3170 3180 3190 32O0 

!IH^ TCGAAAiCACTGAATACAGATGAGCACTACACTATT CATCAACC 2603_ail seq 

^^^^r^^^MM^ cTGAATAGAGATcAccAcTAcAcTATTcATcAAcc 

A II5 5 TTCGAAAACACTGAATAGAGATGA < ; UCTACACTATTGATCAACC cohl all seq 
A II 5 TTCCAAAACACTCAATAGAGATGAC CACTACACTATTGATGAACC cjblll aiHeq 
ATTC GTTCGAAAACACTGAATAGAGA T G A C C A CT A C A CT AT T G A TG A A C C nemSielail seq 
ATTGGTTCGAAAACACTGAATAGAGATGAGCACTACACTATTGATGAACC a909_aiKseq 

AACAG TTCATAACCAAAATACATTAAAAATTACCTTTAAACCAGAGAAAT Majority 
32 . 10 3220 3230 3240 3250 

AACAGTTCATAACCAAAATACATTAAAAATTACGTTTAAACCACACAAAT 2603_ail.seq 

AA C A GTTCAT,AACCAAAATACATTAAAAATTACGTTTAAACCAGAG AAA T 18rs2i_ail!seq 

AAG !5 T ! GATAACCAAAATACATTAAAAATTAC G"TAAACCAGAGAAAT cohl_ail.seq 

AACAGTTCATAACCAAAATACATTAAAAATTACGTTTAAACCAGAGAAAT cjblll aillseq 

AACACTTGATAACCAAAATACATTAAAAATTACGTTTAAACCACACAAAT ne«3l6"ail .seq 

AACAGTTGATAA CCAAAATACATTAAAAATTACGTTTAAACC AG A G A A A T a909_aU.seq 

TTAAAGAAATTGCTGAGC TACTTA A A G G A A TGACC CTTGTTAAAAATC AA Majority 
3 ^ 6Q 3270 3280 3290 33O0 

TTAAAGAAATTGCTGAGCTACTTAAAGGAATGAC.CCTTGTTAAAAATCAA 2603_aii seq 
TTAAAGAAATTGCTGAGCTACtTAAAGGAATGACCCTfCTTAAAAATCAA 18rs2I ail!seq 
TTAAACAAATTCCTGAGCTACTTAAAGGAATGACCCTTGTTAAAAATCAAcohlaTi seq 
TTAAAG.AAATTGCTGACCTACTTAAAGGAATGACCCTTGTTAAAAATCAA cjblli_all seq 
TTAAAGAAATTGCTGAGCTACTTAAAGGAATGACCCTTGTTAAAAATCAA ne*316_ail.seq . - 
TTAAAGAAATTGCTGAGCTAGTTAAAGGAATGACCCTTGTTAAAAATCAA a909_aii.seq 

GATGCTCTTGATAAACCTACTGCA. AATACAGA T G A T CCCCCATTTTTQCA Majority 
m 3310 ; 3320 3330 3340 3350 

GATGCTCTTGATAAA.CCTA'CTGCAAATACAGATGATGCGGCATTTTTGG'A 2603_ail seq 
GATGCTCTT-GATAAAGCTAG.TGCAAATACA. GATGATGCGGCATTTTTGGA 18rs21 ail seq 
G A T G CTCTTGAT AAAGCTACTGC AAATACA GATG ATGCGGC ATTTTTGGA cohl ail sea 
GATGGTCTT G A T AAACCT ACTG C.A A A T A C A GAT C A T. G C G G C A T T- T T T G G A - cjblll ailTseq 
.GATGCTCTTGA.TAAAGCTACTG€AAATACAGA.TGATGCGGCATTTTTGGA nem316~ail.seq 
GATGCTCTTC ATA AAGCT A CTGCAAATAC A CATG ATGCGGC ATTTTTGGA a909_aii.seq 

AATT CCAC TT GC ATC, AACTATTAATG A. A A A A G C A G T TTT A C G A A A A G C A A. Majority 
3360. 3370 , 338Q 3390 3400 

^ AA I T S CAGTtGCATCAACTAtTAATGAAAAACCAGT ail.seq 
aI **^I^^f° TTGCATCAACTATTAA " rGAAA AAGCAGXTXXA 0.G AAAAGCAA I8rs21 ail^eq 

H AA T™CAGTTGCATCAACTATTAATGAAAAAGCAGTTTTACGAAAAGCAAcohl ail.seq 

A J T CCAGTTGCATCAACTAT.TAATGAAAAAGGACTTTTACGAAAAGCA.A cjblll all!se q 

tk ! II^! GTTGCATCAAGTATTAATGAAAAAGCAG "TTA.GCAAAA.GCAA ne*3l6~ai 1 .se^ 

50 AATTCCAGTTGCATCAACTA.TTAATGAAAAAGCAGTTTTAGGAAAAGCAA a909 all. seq 
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,T TG A A A A T AC T T TT G A A CT KA ATATC ACC ATACTCCTC ATA A AC.CTC AC Majority 

_ « ... • • 



3420 - 343Q 3440 3450 

' '-I ^ I • • " 



II I J- G A A A A T A CTTTTG AA C TT C A A T A T G A C,C A T A C T C C T G At A A A G CTG A C 2603 ail.seq 

tl ZZ5 A A A A T A C T TT * G A A 6 T T:C.A A TAf G A CCA J AC TC C T G A T A A A G CT G AC :i8rs5l aiHeq 

2 I I ® A A A A T A C T T TT G A A C;T T,C" A-A'T;A-.T„ G A C C- A T A C T C.C T GAT A *A A- G C T : fc A C cohl: ai^seq 

99 T T G'A A A A T A C T-T * Ii " r r - * a. r» .«r t a,, a -r a ******* ~ ~ ~ ~ * . . . -~ . _ * 

01 t T G A. A. A A TACT T 

0O T T G A A A A T A C T T T 

A AT CCA A AA CC ATC T A^A T C-C TIC C A A G A'.'A A AC C.A G A AG TT C. A T A C TIC'G. T G G Maioritxr 
■ 34 ,6P.- 3470 .3480 3490 .. 3500 

48 A A-T CCAAAACCATG T A A*T OCT C C A'AG A A A A C C A G A.A GTTC AT ACTG CTG G 2603 ail,seq 
17 A A t C C A A A A C C A T C T A A T C C T.C. C A A G A< A AA C C A G A A G T T C A T A G t G G T G G 18rsZI_ailleq 

49 AATCCAAAACCATCTAATCC T C C A AG A A A AC C A G A -AG TT C ATA C T C G TGG cohl ail.seq 

49 AATCCAAAA C C A T C T A A TC C T C CA A C A AAACCAG AAGTTCATACTGGTGG cjbill^allseq 
51 A A T C C A A A A C C A T C T A A T C C T C C A A G A .A A A C C A G A A G T T C A T A C T G G T G G nem316 ail seq 

50 AATCCAAAACCAX CTAATC'CTCCAAGAAAACCAGA AG T TC A T AC T G GT G G a909 all seq 



.. .... 
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Jfignmen, Report ofAH WO JOOe/OTMlS^Xw^^woigM^,/ ~ PCT/US2005/027239 

rhi «^ ay, July 29, 2004 5:46 PM 1 

g a a a c c AltrMaj: jjU PSlJ gEg « ^CT aaacacaaacactacgtggtg Maiontv 



3510 3520 3530 3540 3550 

' . . , ■■■ * 1 : 1 JL 



1548 
1317 
1549 
1549 
1551 
[550 



1498 GAAACGATTTCTAAACAAAGACTCAACAGAAACACAAACACTAGGTGGTC 2603_ail seq 

1267 CAAACCATTTGTAAAGAAAGACTCA AC AG AAACACAAACACTAGGTCGTC 18rs21 ailseq 

499 GAAACCATTTGTAAAGAAAGACTCAACAGAAACACAAACACTAGGTGGTG cohl.ail seq 

U99 GAAACGATTTGTAAAGAAAGACTCAACAGAA AC ACAAACACTAGGTGGT G cjbill ailseq 

JSOl CAAACGATTTGT. AAAGAAAGA.CTCAACAGAAACACAAACACTAGGTGGTG ne*i3l6 ail seq 

1500 CAAACGATTTGTAAACAAAGACTCAACACAAACACAAACACTACGTCGTG a909_ail. seq 

CT>GACTTTCATTTCTTCCCTTCTGATCGCACAGCACTAAAATCCACAGAT Majority 
3 5 60 3570 3580 3590 3600 

C T.G A C T T T G A T T T G T T G G CT T C T G A T G.G GACAGCAGTAAAATGCACAGA'T 2603 ail seq 

CTGAGTTTGATTTGTTGGCTTCTGATGGGACAGCAGTAAAATGCA'CAGAT 18rs21 all seq 

CTGAGTTTGATTTGTT GG CTTCTGATGGGAC AG CAGTAAAATGGACAGAT cohi_ail seq 

CTGAGTTTGATTTCTTCGCTTCTCATGGGACAGCAGTAAAATGGACACAT cjbill ailseq 

CTGAGTTTGATTTGTTGCCTTCTGATGGGACAGCAGTAAAATGGACAGAT nem316~ail seq 

CTGAGTTTGATTTGTTG.CCTTCTGATGGGACAGCAGTAAAATG G A CAGAT a909_aiI. S eq 

GCTCTTATTAA AGCGAATACTAATAAAAACTATAT T G C TGGAGAAGCT G T Majority 

' 1 : i — 1—— — -r- J 

36 , 10 3620 3630 3640 3650 

CCTCTTATTAAAGCGAATACTAATAAAAACTATATTGCTGGAGAACCTGT 2603_.ail.seq 
GCTCTTATTAAAGCGAATACTAATAAAAAC.TATATTGCTGGAGAAGCTGT i8rs2I_all seq 
GCTCTTATTAAAGCGAATACTAATAAAAACTATATTGCTGGAGAAGCTGT cohl_ail seq 
GCTCTTATTAAAGCGAATACTAATAAAAACTATATTGCTGGAGAAGCTGT cjbUl all seq 
GCTCTTATTAAACCGA AT ACTAATAAAA A'G TATATTG.CTGGAGAAGCTGT nem3t6_all seq 
GCTCTTATTAAAGCGAATACTA AT AAAAACTATATTGCTGGAGAAGCTG. T a909__ail.seq 

T A.C T'G C C C A A C C A A T C A A A T T.G A A A T C A'C A T A C A G A C C C T A C.C T T T G A G A Major i ty 

3660 3670 3680 3690 3700 
— ! « f : * ; i i__ 

T A C T G C G € A A C C A A T C A A A T T G A A A T C A G A T A C A G A C G G T A C G T.T TCAGA 2603_ail seq , 
417 T A C T G G G C A A C C A A T C A A A T T G A A A T X A C A T A.C AGACCGTACGTTTGAGA _8rs21_auleq 
649 T A C T G G G C A A C C A A T C A A A t T.G A A A T.C A C A T A C A G A C G GT A C C T T T G A G A cohl_all.seq 
649 T AG T G G G C A A C C A A T C A A A T T G A A A T C A C A T A C A G A C 0 G T A C G T T T G A G A cjbll Call seq 
TACTGGGCAACCAATCAAATTGAAATCACATACAGACCGTACGTTTGA G A nem316_ail seq 
TACTGGGCAA.CCAATCAAATTGAAATC.ACATACAGACCGTACGTTTGAGA a909_all.seq 

TTAAAGGTTTGGCTTAT GCAGTTGATCCGAATCCAGAGCCf ACAGCAGTA Majority 

i — : i i : 1 r- J J 

3710 3720 3730 3740 3750 

_____ _J 1 1 L 



598 
367 
599 
599 
601 
6G0 



648 



651 

6SO 



B98 tTAAAGGTTTGGCTTATGCAGTTGATGCCAATGCAGAGGGTACAGCAGTA 2603_all seq 

467 TTAAA.CGTTTGGCTTATGCAGTTGA'tGC G. A ATGCAGAGGGTACAGCAGTA 18rs21_aiKseq 

S99 TTAAAGGTTTGGCTTATGCAGTTGATGCGAATGCACAGGGTACAGCAGTA cohl_ail.seq 

S99 TTAAAGGTTTGGCTTATGCAGTTGATGCGAATGCACAGCGTACAGCAGTA cjbUl ali^seq 

7QI TTAAAG GTTTGG CTTATGCAGTTG' ATGCGAATGCACA. CGGTACAG. CAG.TA nem3l6_ai I .seq 

700 TTA'AAGGTTTG'GCTTATGCA.GTTGATGCGAATGCAGAGGGTACAGGAGTA a909_ail.seq 

A C T T A C A A A T T A A A A G A A A C A A A A G C A C C A G A A G G T T A T G T A A T C C G T G A Maj or i ty 

'3760 3770 ■ 3780 3790 3800 

ACT T A C.A AATTAAAAGAAAGAAA.AGCAeCAGAAGGTTATGTAAT C C C T G%A- 2603 ail, seq 



rsi 

rso 



S67 

rag 

T99 



JAS 

m A.C XT A.fJ A A AT T A AAA G A A A G A A A A G C A C CA G A A G G T T X T Q T A' AT fe C G TG A Ifi^llaiOeq 
T4S A C T T A C A A A T T A A A: A G;A A A C A A A A G C A G.G A G A A G G T T A T G X A AAT. G C C T G A cdrijall.seq v 
T49 A CT T AC A AATTAA-.A.A G A A A C A A A A. G GAG ,C A G A A-.G.G T T A T G t A A?T«C'C G* T : G A" ij t* _ii__a ii , seq 
m * A g;t ' T " A - c A TT^*M;G^iA ; € A A A A -fB-d A fe G. A AGG *T.Avt ^I^j^^Wl^^^. 
A C;t T;.A,C AAATT A^A A: A OA. A A € A Ar A' ;A/'G "G ! A ,;C"C A C-A^^ Gf f A #0^^^^Gi^'&_# 

T A A.A ^ A A A T C C A CT t T A C A G T X.T G A G A A A C A T C t.T 'A T. Lk^lk^^Lt;^^ M^jdrity^ 

3810 3820 ' 3830 . 3840 3850 

r98 T A.A AGAAAtCCA G T.T T A O A G T AT C A C A A A C A.T C TTATA AT AC A A A AC CA A 2603_:ail.seq 
c T A A A G A A A T C G A.G T T T A C A G T A T C A C A' A A C A T C T T A T A A T A C A A A A- C C A A 18rs21,_aH .seq 
T A A A G A A A T C C A G T T T A C A G t A T C AX A A A C A T C T.T A T A A T A C A A A A C C A A cohU^H.seq 
T A:A A G A A AT C G A GT T T A C A G T A T C A C A A A C A T CT TAT A A T A C A A A A C'C A A- , cjblH jail Jseq 
J01 TAAAG A A ATCCA G T TTAC A.QTATC ACAAACATCTTAT AAT.ACA AA ACC AA neo316 ail ,seq 
JOO T A:A A G A A A T C G A G T T T A C A G T.A T C A C A A A C A T C T T A T A A T A C A A A A C C A A a9b9__ail,seq : 
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3848 
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^I^ A I A ^ AAAAAACCACT AGCAGGCGTTGTCTTTGAACTtTATGAAAA 18rs21 ail!seq 

toll ^I^ A I^^ AAAAAA ^ ACTAGCAGG CGTTGTCTTTGAACTTTATGAAAA cohl_aTl .seq 

GG Z! GATACCAAAAAA CCACTAGCAGGCGTTGTCTTTGAACTTTATGAAAA cjbtll aiiseq 

^ ^ J G A T A C C A A A A A A C C A C T A G C A € G C C T T G T C T T T G A A C T T T A T G A A A A ne*316_ail ^eq 

1600 GGTGATACCAAAAAABcACTAGCAGGCCTTGTCTTTGAACTTTATCAAAA a909__ail.seq 

G A .A T G G T A G G ACTCC T A T T CGT C TC A A A A AT GG C C T C CATTCTCAAG ATA Majority 
46 t 6Q 4670 4680 4690 4700 

1648. GAATGGTACCACTCCTATTCGTCTGAAAAAtGGGGTGCATTCTCAAGA T A 2603 ail/seq 

"SI GAA TGGTACGACTCCTATTCGTGtGAAAAATGGGGTGCATTC T C A A G A T A 18rsil„aii?seq 

£S I GCTACGACTC CTATTCGTGTCAAAAATCGGGTG'CATTCTCAAGATA cohl ail. seq 

649 Q A A T G G T A G G A C.T C C T A T T C G T G T G A A A A A T G G G G T G C A T T C T C A A G A T A cjblTi airseq 

I65i C A A T G G TAG G A C T C C T A T T C G T G .T G A AAA A T CG G GT.G CATTCTCAAG ATA ne«316~ail seq 

1650 GAATGGTAGGA C T CCTATTCGTGTGAAAAATGGGGTGCATT. CTCAAGATA a909_aTl.seq 

□ 

TTGACGCTGCAAAACAT TTAGAAACAGATTCATCACGGCATATCAGAATT Majority 
' A7 }° 4720 4730 4740 4750 

J698 TTGACGCTGCAAAACATTT'AGAA'ACAGATTCAT C A GGGCATATCAGAATT 2603 ail seq 

U67 TTGACGCTGCAAAACATTTAGAAACAGAT T C A TCAGGGCATATCAGAATT 18rs21 ail sea 

^ T T G A C G C T G C A A A A C A T T T A G A A A C'A G A T T C A T C A G G G C A T A T C A G A A T T cohl all. seq 

699 TTGACGCTGCAAAACATTT AG A AACACATTCATCAGGGCATATCAGAATT cfblll ail^eq 

™ t t S ^ GGCTGC'AAAACATTTAG A A ACAGATTC A TCAGGGCATATCAGAATT nem316~ail ^ 

1700 TTGACGGT G C A A A A C A T T T A G A A A C A G A T T C A T C^AG G G C A T A T C A G A A T T a909_ail.seq 

TCCGGGCTCATCCATGGCG ACTATCTCTTAAAAGAAATGGAGACA'CACTC Majority 



1848 

1617 

1849. 

1849 

1851 

1850 



4760 4770 4780 4790 



4800 



TCCGGGCTC AT CCATCGGGACTATGTCTTA AAA CAAATCGAGACACAGTC 2603 ail seq 
TCCGGGCTCATCCATGGGGACTATGTCTTAAAAGAAATCGAGACACAGTC 18rs21 aiiseq 
TCCGGGCTCATCCATGGGGACTATGTCTTAAAAGAAATCGAGACACAGTC cohl all.seq 
TCCGGGCTCATCCATGGGGACTATGTCTTAAAAGAAATCGAGA C*A CAGf C ciblU aiiseq 
TCCGGGCTC A T CC AIGG GG ACTATGTCTT A AAAC A A AtCG AG A CACAGTC nem316~ail*.sS 
T C C G G G C T C A T C C A T G C C G A C T A T G T C T T A A A A G A A A T C C A G A C A C A G T C a909_la7l ..seq 

A C G A T A T C A G A TC C G A C A C <G C A C A G A C T G CT GT C A C T AT T G AAA AAT.CAA- Majority- 

-• 48 ,*° \ 4 «2Q . 4830 4840 - 4850 

1798 A G-.G A T A T C AG A T C G G A G A- G G C A GAG ACTGCTGTG ACTA TT G A A A A A TC A A 2603 ali.seq 
^ ft r ^ A T A T G A G A T C G*G' A C A..G G <5 A G A G A C T. C C T. G *T G A G T'. A T. I G A A A A A T^C A A 18rs21. aiiseq 
r. A :TM € A G A ;T G G G AC A GjGrC "A GAG ACTGCTGTG ACT AT t G A A -* A A T C A -A- cohl all^ed : 
A T "A T G A G A T C G G'AC A G C C A G * A :fc ..A. r It r. r t pt n a. /> *p a:* t, o'a.a » v ^ * * ~ . — t 



1567 A G G 

1799 A G G 

1799 A 
1801- A 

1800 A 



• : seq 



A;-€ ;€Wjfc ;f--A A-CV.A' T %-G !A-' Af. A' A -.T ''A:. A . A . A A AG TT C'C G AC AC CT A-X A G T G Maloritv 

'. 4 8. 6Q , . 4870 , : 4880 4890 4900 

A A A G A G T A A C A G T A A C G A T TC A A.A ATA A*A A A AGTTCC.G'A C AC C T AAA G. T G 2603 ail seq . 

A A A C A G T A A C A G T A A C G A T T G A A A A T A A A A A A G T T C C G A C A C C T A.AAG T G 18rs21 ail^seq 

A A A C A G T A AC A GT A AC G A T T G A A A. A T -A .A A A A AGTTCC G AC A C C T A A A G T G cbH l_a7l . seq 

A A A C A G T A A C A G T A A C G A T T G A A A-A T A A A'AA AGTT.C C GAC ACCTA AAGTG cjblll^ail^eq 

AAA CAGTAACA GTAACG ATTG. AAA ATA A A A AA G T T G C C A C A C C T A-A A GT G neii316_aii seq 

A A A C A G T A A ;C A G T A ACGATTGA A.A ATAA AAA AGTTG C-G AC ACCTA A A-C T G a909_ail.seq 
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ccatct rc'tecHs 'A <t l € rgfrTt tFe g c y r A"i r< 





Tit fTC € CYH A'C ACCTCKCAACACCCAATCGC Majority 



4998 
4767 
4999 
1999 
5001 
5000 



5048 
1817 



5051 
>050 



i098 
\BG7 
►099 
►099 
► 101 
51O0 



4 Q 10 4920 4930 4940 4950 

4898 CCATCTCGACGACGTCTTATTCCCAAAACACGTCACCAACAGCCAATGGC 2603 ail seq 
4667 CCATCTCGAGCACCTCTTATTCCCAAAACAGGTCACCAACACCCAATGGC I8rs2l ail seq 

4899 CCATCT C G A C G A G G T C T T A TT CC C A A A A C A G GTG A G C A A C A G C C A AT G C C cohl all seq 

4899 CCATCTCGAGGAGGTCTTATTCCCAAAACAGCTGAGCAACAGC. CAATGGC cj b Ui_a i 1 seq 
4901 CCATCTCGAGGAGGTCTTATTCCCAAAACAGGTGAGCAACAGGCAATGGC nea3I6 ail seq 

4900 CCATCTCGACCAGCTCTTATTCCCAAAACACGTGAGCAACAGGCAATCGC a909_aTl.seq 

ACTTGTAATTATTGGTGGTATTTTAATTGCTTTAGCCTTACGATTACTAT Majority 
49 , 6Q 4970 .4980 . 4990 . 5000 

4948 ACTTGTAATTATTGGTGGTATTTTAATTGCTTTAGCCTTACGATTACTAt 2603_ail seq 
4717 ACTTGTAATTATTGGTGGTATTTTAAT TG CTTTAGCCTTACGATTACTAT 18rs2l_aU seq 

4949 ACTTGTAATTATTCGTGGTATTTTAATTGCTTTAGCCTTACGATTACTAT cohl_ail seq 

4949 ACTTGTAATTATTGGTGGTATTTTAATTGCTTTAGCCTTACGATTACTAT cjblll ai lseq 
4951 ACTTGTAATTATTGGTGGTATTTTAATTGC T T TAGCCTTACGATTACTAT nem316_ai 1 "seq 

4950 ACTTGTAATTATTGGTGGTATTTTAATTGCTTTAGCCTTACGATTACTAT a909_ail.seq 

CAAAACATCGGAAACAT CAAAATAAGGATTA GC ATGGCACAAAAATCAAA Majority 
_ SQtO 5020 5030 . 5040 5050 

CAAAACATCGGAAAC ATCAAAATAAGGATTAGCATGGGACAAAAATCAAA 2603_aii seq 

CAAAACATCGGAAACATCAAAATAAGGATTAGCATGGGACAAAAATCAAA 18rs2i_all seq 

C A «A A ACATCGGAAACATCAAAATAAGGATTAGC ATGGCACAAAAATCAAA coh!_alKseq 

CAAAACATCGGAAACATCAAAAT A A GGATT. AGC ATGG'G AC. A A A A A-T C A A A cjbUl_ailseq 

CAA A ACATCGGAAACATCAAAATAAGGATTAGC ATGGCACAAAAATCAAA neo3l6_all 

CAAAACATCCGAAACATCAAAATAAGGATTAGCATGGGACAAAAATCAAA a?09_ail.seq 

A ATATCTCT AG CT AC GAATATTCGTATAT GC ATTTTTCGTTTAAT. TTTCT Majority 

5060 ■ 5070 5080 5090 5100 

A A TA'TCTCT AGCTACG A AT ATT CGTAT ATGG AT TT.TTCG TT TAATT-TTCT 2603_ail seq 

A A X A T C.T CT. AG GTACGAATATTCGTATATGGATTTTTCGTTTAA-T TTTCT 18rs21_ail seq 

5049 A ATATCTCT AC CT AC G A ATATTCG.T AT ATGG A T T T T T C G TTTAATTTTCT cohl_ail,seq 

5049 AATATCTCTAGC TAC GAATATTCG.TATATGGATTTTT C G.TT T A AT TTTCT cjblll_all seq 

AATATCTCTAGCTACGAATATTCGTATATGGATTTTTCGTTTAATTTTCT ne*316_ail seq 

AATATCTCTAGCTAC G A AT ATT CCt ATATGCA T TT TTCGTTTA ATTTTCT a909_ail.seq 

TAGCCCGTTTCC TTCTTTTGGCAT TT CCCATCCTTACTCACCTCATCTAC Majority 
5 HO 512Q 5130 5140. 5150 

TAGCGGGTTT C C.T T G TTTTGC.CATTTCC C AT CGTTAGTCAGGTCA T-G TAC 2603_ail.seq 
TAGCGGGTTTCCTTGTTTTGGCATTTCCCATCGTTAGTCAGGTCATGTAC 18rs21_ail seq 
TAGCGGGTTTCCT.TGTTTTGGCATTTCCCATCGTTAGTCAGGTCATGTAC cohl.ail.seq 
TAGCGGGTTTCCTTGTTTTGGCATTTCCCATCGTTAGTCACG T C A T G T A C cjbill^ail seq 
TAGCGGGTTTCCTTGTTTTGGCATTT.CCCATCGTTACTCA.CGTCATGTAC nem316_ail seq 
TAGCGGGTTTCCTTGTTTTG CCA TTTCCC AT.CGTTAGTCAGGTCATGT AC a909__aU.seq 

T T T C A A G CC TC T-CA C C C C A AT ATT A A T'C C TT TT A A'-A C A AC C T CTT.A C C A A Majority 
^ 5 t60 5170 . 5180 . 5190 • 5200 

i!48 T.TT C A A.GsC CTCT CAC G 'Q C A A T A T T A A T G C T T T T A A A G A A G C T G T T A C C A A 2603_all seq 

917 T TT C A AG CC :T; C T C A C G C C A AT ATT-A ATGCTTT T A A A G A A CC T GT.T A C C A 5 A 18rs2X. ail^eq 

149 TTTCAAGCCT C.T C AC GC C A A T AT TAATGCTTT f AAA G A CT G TT AC G-A A cohliiu sea' 

149 T T .T. C A A G C C T C.T CjA .C;.G,C C A A T A T T A A T G '€"% T T T A A1 G 11^ A G jG T ^G - THf 'A C X A. A 'cjoiilLai i v seq 

151 t T^r c a AG c e/*; ! $ ax.^ : 

l$Q T TT C A AG CG T;C' T ; G* A/C G> C C{ A AT ATT A- A T G C T T T t A 'A '-Tgj& A A^ £ *'Bi^'C'C ; fi '^11^ 

GAT T C AC C C G ,CiT G G A G A T t A A:T C G G C G T T T. A Q A A C >T^. T. G} C. X • % : A: T/G , C T -T • A' . T A Ma j'ority 

5210 . 5220 523 0 . 5240 5250 

198 G A.=T f G A CCCG.CTGGAGATTAATCCGCGTTTAGA ACTTG CT T A TG C TT A T. A 2603_ail seq 
967 G A T T G A C C G G G T G G A G A T T A A T C CC C G T T T A G A A C T T G C T T A T G C T f A T A 18rs2l ail seq 

199 G A T T G A C C G G G T G G A G A T T A A T C G G C'O T T T A G A A C T T G C T T A T G C T T A T A cohCaTiiseq 

199 GATTGACCCGGTGGAGATTAATCGGCGTTTAGAACTTGCTTATGCTTATA cjblll_ail seq 
201 G A T T G A C C G G G T G G A C A T T A A T C G G C G T T T A G A A C T T G C T T A T G C T T A T A neo316^aU.seq 

200 G AT T G A C C G G G T G G A G A T T A A T C G G C G T T T A G A AC T t G C T T A T G C T T A T A a909„ail seq 
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~ ' —I — t . 

5260 



SB/9 ^7 



•p 



CT/US2005/027239 



5248 
5017 
5249 
5249 
5251 
5250 



GGCCA ATATCCACCGCTTAAA Majority 
5280 5290 5300 



5298 
5067 
5299 
5299 
5301 
5300 



J348 
>U7 
i349 
J349 
»351 
►350 



398 
5167 
399 
399 

;4oi 

400 



448 
217 
449 
449 
451 
450 



498 
267 
499 
199 
501 

500 



ACCCCAfTATArrirrTrrriAi^ I8rs21^ai l^eq 

TATAGCAGGTGCCAAAACTAATGG CCAATATCCACCGCTTAAA cibUI ait sea 

ACCCC^CTATArr^rrTr TBI CACCGCTTAAA ^3 6^ ^ 

ACCCCACTATAGCAGGTGCCAAAACTAATGGCCAATATCCACCGCTTAAA a909_aU. S eq 




G A 
G : A 
G A 
G A 
G A 
G A 



C C 

C C 

C C 

C C 

C C 

C C 



C C 
C C 
C C 

c c 
c c 
c c 



T A 

T A 

T A 

T A 

T A 

T A 



C T 

C T 

C T 

C T 

C T 

C T 



C 
C 
C 

c 
c 
c 



T 
T 
T 
T 
T 
T 



G C 

G C 

G C 

G C 

G C 

G C 



T 
T 
T 
T 
T 
T 



G 
G 
C 
G 
G 
G 



A 

A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



C A 
C A 
C A 
C A 
C A 
C A 



A 
A 
A 
A 
A 
A 



A A 
A. A 
A A 
A A 
A A 
A A 



G C 

G C 

G C 

G C 

G C 

G C 



A C 

A C 

A G 

A G 

A G 

A G 



G 
G 
G 
G 
G 
G 



C 

c 
c 
c 
c 
c 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
C 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



T C C-T TGACTACGCCCG 
TCGTTGACT AC G C C C G 
T C G T T G A G T.A C G C C C G 
TCGTTGAGTACGCC C G 
TCGTTGAGTACGCCCG 
TCGTTGAGTACGCCCG 



2603_ail.seq 
18rs2I_aii'.seq 
coh l__a i 1 . seq 
cjblll_ail.seq 
neo3 1 6_a H . seq 
a909_all.seq 




C A T G 
CATC 
C A T G 
CATC 
C AT G 
C A T G 



C T T G 
C T T G 
C T T G 
CTTC 
CTTC 
C T T G 



ATCAGGAT 



A A 
A A 
A A 
A A 
A A 
A A 



A T 
1- 



G 1 
G T 
G T 
G T 
G T 
C T 



C A 

C A 

C A 

C A 

C A 

C A 



A 
A 
A 
A 
A 
A 



A G 
A G 
A G 
A G 
AG 
A G 



A 
A 
A 
A 
A 
A 



A C A A 

A C A A 

A C A A 

A C A A 

A C A A 

A C A A 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 

A 

i 



G 
G 
G 
G 
G 
G 



G T 
G T 
G T 
C T 
G T 
G T 



C 
C 
C 
C 
C 
C 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



G T G 

G T G 

G T G 

C T G 

G T G 

G T G 



ATTATTCC 
ATTATTCC 
ATTATTCC 
ATTATTCC 
ATTATTCC 
ATTATTCC 



A A G A A 

A A G A A 

A A G A A 

A AC A A 

A A G A A 

A A G A A 



T T A Majority 
5400 

TTA 2603_.all.seq 
TTA 18rs21_ali.seq 
T T A cohl__ail.seq 
TTA cJblil_aU.seq 
TTA nen316_aii,seq 
TTA a909_aH.seq 



CCCTATTT 



A T 
A T 
A T 
A T 
A T 
A T 



5410 
s 



C A 

C A 

C A 

C A 

C A 

C A 



G G A T 
G G A T 
G G A T 
G G AT 
G G A T 
C.G'A T 



A C C C t G 
1 ■■■ ■ - — 

5420 
1 



G C T 



CT.CC 
i — ■ 



T G A A C A A A A T C T T C A G 



A 

A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



C 
C 

c 
c 
c 
c 



C C T 

C C T 

C C T 

C C T 

C C T 

C C T 



A 
A 
A 
A 
A 
A 



T T T 

T T T 
TTT 
T T T 
TTT 



A 
A 
A 
A 
A 
A 



5430 
I 



A G C 
r* 



Majorlty 



5440 

•i 



5450 
lL 



C C C T C G C T C TGCTCA ACAAA ATCTTCAGAGG 
CGCTGGCTCTGCTGAAGAAAATCTTCAGAGG 
CGCTGGCTCTGCTGAAG AAA ATCTTCAGAGG 
CGCT.GGCTCTGCTGAAGAAAATCTTCAGAGG 
CGCTGGCTCTGCTGAAG' AAA AT C-T T C A G A G G 

CGCTGGCTCTGCTGAAGAAAATCTTCAGAGG 



2603_ail r seq 
18rs21_ail.seq 
coh.l_ail .seq 
cjbill_;ail.seq 
nea3 1 6_a i 1 . seq 
a909_ail.seq 



GGCG TTGCACAT T T 
~ 1 



G A G C G G 
— : 1 — 



5460 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



C G 
C G 
C G 
C G 
C G 
C G. 



T T G 
T.TC 
T.T G 
T T G 
T T G 
T T G 



5470 
t 



ACCAGTCTTCCAG 

• - 



T C G G T 



5480 



GCTGAGTCAAC 
T : I 



Majority 



G 
G 
C 
G 
G 
G 



A 
A 
A 
A 
A 
A 



C 
C 
C 

c 

C 

c 



A T T T 
A T T T 
ATTT 
A T T T 
ATTT 
ATTT 



A 
A 
A 
A 
A 
A 



GAG 
GAG 
GAG 
GAG 
GAG 
GAG 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



A.C 
A C 
A C 
A C 
A C 



C 
C 
C 
C 

c 



A 
A 
A 
A 
A 



G T 

G T 

G T 

G T 

G T 



A C C AC T 



C T 
C T 
C T 
C T 
C T 
C T 



5490 
I 



5500 



T C 
T C 
T C 
T C 
TC 
T C 



C A G 

C A G 

C A G 

C A G 

C A G 

C A G 



T C G 

T C G 

T C G 

T C G 

T C G 

T C G 



G T 
G t 
G T 
G T 
G T 
G T 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



T C A 

T G A 

T G A 

T G A 

T G A 

T G A 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



C A 
C A 



C 
C 
C 

c 



A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



C 
C 
C 
C 

c 

C 



2603__ail.seq 
18rs21_ail.seq 
oohl__ai 1 .seq 
cjblll_ail.seq 
nem3 1 6_a i 1 . seq 
a909_a i 1 . seq 



TCATGCCGTTCTA ACTCC 
— • . i : ; ! 



C C A T C 
1 



GAG 



5510 
i 



T 
T 
T 
T 
T 
T 



C A T 
CAT 
CAT 
CAT 
CAT 
C AT 



552Q 
* ■ 



G C CTA'CCA ACGGC C A AGCTATTTA Majority 



G C C G 

G C C G 

G C C G 

G C C G 

cccg 

GCCG 



TTCTAA 
T T C T A A 
T T G- T A A 
T T C T A A 
f T C:T- A* A 
TTCTAA 



CTGC 
C T G C 
C T G..C 
CTGC 
G T G C 
CTGC 



5530 

» ,:„, 



CCA T;C 
C C A T* G 
C C A T G 
C C A TC 
C C A .T C 
C C A T G 



5540 



5550 



G G C T A C C A.A C G G GCAA' GCTATT t A 2603_ail. seq 
G. G.C T A C C;A A C G.G C C A A G C T ATT T A 18rs21^ail seq 

"G C T A '-CI f\ A f? fl fl <? 'fr' A A p t ,t « n- -wf -n * * • 



G A G 

G.A 1 G 
GAG 
G < A G 
G/Ai; 

G K.GiG.tT.tC ? A A G.G G.C G A A^C C T -k T T T A ^90^:" kit. seq 



G C e T A C C A : A JG G G C G A A G C T A T T T.. A oohl all -seq 
G G C. X A ,C C A A C G.G C G A A G C T A T T T A cjbl tllaitVseo 
G G>(> T A C C A;;A ; G.G#:caA : AG;C:T A : tT T A ^'psi^ 



C C A AT T T A G A CAAGCT A 

• ' ™ ' \ ^^^^^^^^^^ 



C AG.T A 
1 rr- 



G G T G 



5560 



A C € G 
— 



T T T>T T A C A. T.T G.A A C A CAT C Majority 



548 

J17- 

>49 

>49 

>51 

»50 



5570 
t , 



C,C AATT'TACACAAGGTAAC ACTA 
C C AATTTAGACA AG G T AA C A G/T A 
CCA ATTT.ACACAAGGTA ACiGTA 
C C A A T T T A G A C A A G G T A A C A G T A 
CCAATTTA.GACAAG G..T A ACAGT A 
CC AATTTAG ACAAGGTA ACAGTA 



5580 
i- 



* 

5590 
: t 



5600 
• * 



G G T G A C CGTT T.T TA'CATIUA'C AC ATC 
GGTGACCGTTTTTA C.A'T T :G A A CAC A T C 
G.GTG ACCGTTT TT,A CATTG'AACAC A.T C 
G G T G A "CC G TT TTT A C.A T T G A AC AC A T C 
GGTGACCGTTTTTACATTGAACACATC 
GGTGA.CGGTTTTTACATTCAACACATC 



2603^_all;seq 

18rs21_aii:seq 

cohI_all .seq " 

cjblll_all.seq 

neB316_all.seq 

a909__aiKseq 
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FIGURE 18 O 
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G C C G G A K W&A HfT^fc MffW <P^G jjggjg fflt' A U A ATCAAACTTATCGCCCCTCA Majority 
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648 
417 
649 
649 
651 
650 



698 
467 
599 
599 
701 
700 



748 
517 
749 
749 
751 
750 



501 
300 



198 
»67 
t99 
J99 



5610 5620 5630 5640 5650 



598 GGCGGAAAGATTGCTTATCAGGTAGACCAAATCAAAGTTATCGCCCCTGA 2603_ail seq 
367 GGCGGAAAGATTGCTTATCAGGTAGACCAAATCAAAGTTATCGCCCCTGA 18rs21_all seq 

599 GCCGGAAAGATTGCTTAT.CAGGTAGACCAAATCAAAGTTATCGCCCCTGA cohl_ail.seq 
599 GGCGGAAAGATTGCTTATCAGGTAGACCAAATCAAAGTTATCGCCCCTGA cjblU_aIl seq 
601 GGCGGAAAGATTGCTTATCAGGTAGACCAAATCAAAGTTATCGCCCCTGA nea316_ail seq 
GOO GGCGGAAACATTGCTTATCAGGTAGACCAAATCAAAGTTATCGCCCCTG.A a909_all .seq 

TCAGTTAGAGGATTTCTACGTGATTCAAGGAGAAGATCACGTCACCC T.A T Majority 
^ 5660 ■ 5670 5680 5690 5700 

TCAGTTAGAGGATTTGTACGTGATTCAACGAGAAGATCACGTCACCCTAT 2603_ail.seq 

TCAGTTAGAGGATTTGTACGTGATTCAAGGAG A. A GATCACGTCACCCTAT 18rs21_ai 1 .seq 

TCAGTTAGAGGATTTGTACGTGAT. TCAAGGAGAAGATCACGTCACCCTAT cohI_ail.seq 

TC.ACTTAGAC6ATTTCTACCTCATTCAAGGAGAAGATCACCT-CACCCTAT cjblll_ail.seq 

TCAGTTAGAGGATTTGTACCTGATT C A ACGAGAAGATCAC GT CACCCTAT nesi316_aU seq 

TCA. GTT AGA GGATTTGTACGTGA T.T CAACGAGAAGATCACGTCACCCTAT a909__ail .seq 

TAACT TGCACACCTTATATGATAAATAGTCATCGCCTCCTCG TTCGAGGC Majority 

: • : « ■ — : 1 — hr - 

' 5710 5720 5730 5740 5750 

TAACTTCCACACCTTATATGATAAATAGTCATCGCqTCCT.ee TTCGAGGC 2603_ail seq 
TAACTTGCACACCTTATATGATAAATAGTCATCGCCTCCTCGTTCGAGGC 18rs2l_ail .seq 
TAACTTCCACACCTTATATGATAAATAGTCATCGCCTCCTCGTTCGAGGC cohl_ail.seq 
TAACTT GCACACCTTATATGATAAATAGTCATCG.CCT CCTCCTTCGAGG.C cJbUl„all.seq 
T A A C.T.T GCACACCTTATATCAT- AAATAG-T.CATCG'CCT-CCTC. GTTCGAGGC neia3l6_.aH .seq 
TAACTTGCACAC C.T TATATGATAAATAGTCATCGCCTCCTCGTTC GA GGC a909_ail.seq 

A A GCG AATTCCTTATGTGGAAAAAACAGTGCA.GAAAGATTCAAAGACCTT Majority 
.5760 . 5770 5780 5790 5800 

A A G C G A AT TCCTTATGTGGAAAAAACAGTGCAGAAAGATTC. AAAGACCTT 2603_aii.seq 
AAGCGAATTGCTTATGTGGAAAAAACAGT G C A G A A AG ATTCAA A.GAC CTT 18rs2l__ail .seq 
AAGCGAATTCCTTATGTGGAAAAAACAGTGC. AGA'AAGATTCAAAGACCTT cohi._ail.seq 
AAGCQ AATTG C.T TATGTCG AAAAAACACTCCAC AAA G ATTCAA AGACCTT cjblll_all.seq 
A A G C G A A T T C C T T A T. GTGGAAAAAACAGTGCAGA A. A GATTC AAAGACCTT nem3l6_ail .seq 
AAGCGAATTCCTTATGTGGAAAAAACAGTGCAGAAAGATTCA A AG AC CTT a909_.all.seq 

CAGGC A.ACAACAATACCTAACCTATGCTATCTGGGTACTCGTTGG A C T T A Majority 

r : i • — i ; r~ 

5810 582Q 5830 5840 ■ .5850 

798 CACGCAACAACAATACCTAACCTATGCTATGTGGGTAGTCGTTGGACTTA 2603_ail.seq 
567 CAGGCAA'CAACAATACCTAACCTATGCTATGTGGGTA'GTCGTTGGACT T A 18rs21_ail seq 

799 CAGGCAACAACAATACCTAACCTATGCTATCTGGGTAGTCGTTGGACTTA cohl_ail seq 
799 CAGGCA ACAAC'AAT A CCT.AACCTATGCTATGTGGGTA GTCGTTGGACTTA cjblll_an.seq 

CAGCCAACAACAATACCTAACCTATGCTATGTGGGTA GTCGTTGGACTTA ne_316_ail.seq 
CAGGCAACAACAATACCfAACCTATGCTA.TGTGGGTAGTCGTTCGACtT A a909_a 11. seq 

TCTTGC TGTCGCTTC TCA T T T G G TT.f A A. A A AG A C C A A A C AGA A A A AG C G G Majority 

5860. 5870 . . " 5880 . 5890 .5900 

*48 T CT T G C T G t:C.G CTTCtCAT TTGGTt T.A A AAA GA CG A A AC AG A A A A A G C G G 2603 ail. seq 

517 T-C T'T G C 'T-G " ^ ^ ^ - ~ * — ~ ~ - - - - • - — - - '- - - -' 

H9 T CTT G C T G 

149 T. C t T. G C T;G T 

i5i t c T'T O C T b;; 

150 T C.T T GX-T (C 

* -* » 

A G A A A G A. A T G A A A A A G C C G- C T. A.G T C A A A. A T ACT C A C A : A ;T, : A'- A j vT. C V G" -AV.- A'^A-'/T' & May or ity 




• .•,i".i_f.' 



5910 5920 5930 5940 . 5950 

A G- A A A G A A T G A A A A A G C G G C T A G j C A A A A T A G T C A C A A T A. A XT C C A A A T A -2603_ail.seq 
A GT A A.A.G AATGAAAAAGCGGCTA GTC AAA ATA GTCACA AT A AT.TCG'A A ATA lSrsZl^all.seq 
A G A A A G A A T G A A A A A G C G G C T A G-T C A A A A T A G T CA C A A t A A t T C G A A A T A. cohl_ail.seq 

C A CA AT AATTCGA A A T A cjbllCall..seq 
K)l A G A A A G A A T G A. A A A A G C G G C T A G T C A A A A T A G T C A C A A T A A T T C G A A A T A nea316_ail.seq 
*>0 AGA A A G A A T G A A A A A G C G G C T A G T C A A A A T A G T C A C A A T A A T T C G A A A T A a909 ail seq 
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5767 
5999 
5999 
5001 
3O00 



J^gnment Report of AHWO 2006/0783 18 method with Weighted msldue weight table ^PCT/US2005/027239 - 1n 

Thursday. July 29, 2004 5:46 PM - ^age 18 

A T A A A A ^M^YW ^^&^M^Y ^ &: ^ CGAAGTCTGATTCTCTTATTT Majority 
59 , 6Q 5^70 5980 5990 6000 

ATAAAATCAGAACCCTCATTTTTGTGATGCCAAGTCTGATTCTCTTATTT 2603 ail sea 
5717 ATAAAATCAGAAC CC TC A T TTT T G T G ATG G C A A GTC T C A TTC T C TT AT T T 18rs21 ail sea 
llf a ATAAAATCAGAACCCTCA TTTTTGTGATGGCAAGTCTGATTCTCTTATTTcohlaTl seS 

5949 ATAAAATCAGAACCCTCATTTTTGTGATGCGAAGTCTGATTCTCTTATTT cjblll aillea 
Slo A I AAAA 1^ AGAA ^"CATTTTTGTGATGGGAAGTCTGATTCTCTTATTT nemSieZail ^ 

5950 ATAAAATCAGAACCCTCATTTTTGTGATGGGAAGTCTGATTCTCTTATTT a909_ai 1 .seq^ 

CCGATTGTGAGCCAGGTAA GTTACTACCTTGCTTCCCATCAAAATATTAA Majority 
J CO, 10 6020 6030 6040 6050 

A TTGTG AGCCACGTA AGTTACTACCTTGCTTCGCATC AAAATATTAA ISrsi? aU^eq 

^5 A 3 T S GAGCCAGGTAAGTTACTACCTTGCTTCGCATCAAAA TATTAA cohl ail.seq 

CCCATTCTGAGCCAGGTAAGTTACTACCTTGCTTCGCA-TCAAAATATTAA cjblU all seq 

^r!^r^! GCCACCTAAGTTACTACCTTGCTTCGCATCAAAATA TTAA nem316""aii; S e^ 

CCGATTGTGACCCACGTAAGTTACTACCTTGCTTCGCATCAAAATATTAA a909_ali.seq 

TC.AATTTAAGCGGGAAG TCGCTAAGATTGATACTAATACGGTTGAACGAC Majority 
6060 . 6070 6080 6090 6100 

TAACCGCCAAGTCGCTAAGATTGATA.CTAATACGCTTGAACGAC 2603_ail seq 
TCAATTTAAGCGCG A AGTCGCTAACATTGATACTAAT.ACGGTTG AACGAC 18rs21 aiiseq 
TCAATTTAAGCGCGA AGTCGC'TAAGATTG ATACTAATACGGTTG AACGAC. cohl all .seq 
TCAATTTAACCGGGAACTCGCTAAGATTGATACTAATACGCTTGAACGAC cjblll all seq 
TCAATTTAAGCGGGAACTCGCTAAGATTGATACTAAT AC G GTTG AACGAC ne^ie^tl'sea 
TCAATTTAAGCGCGA A GTCGCT A AGATTGATACTAATACGGTTG AACGAC a909_all^seq 

CCATCGCTTTACCTAATGCT TACAATGACACCTTATCAAGCAATCCCTTC Majority 

, 6 V° 6120 6130 6140" 6150 

G C A T C G C T T T A G C T A A T G C T T A C A A T G A G A C G T T A T C A A G C A A T C C C T T G 2603iail seq 

G CATCG C.TTTA GCTAATGCTTACAATGAGACG'TTATCAAG GA AT CCCTTG 18rs2i ail seq 

™* 5 S f I "5 S CTTTAGC.TAATGCTTACAATGAGACGTTATCAAGGAATCCC.TTG cohl ail.seq 

™* GCATCaCTTTAGCTAATCC.T.TACAATGAGACGTTATCAAGGAATCCCTTG cjblll ail seq 

!™ ?S A ^ GG CTTTAGC TA ATGCTT A C A ATGAGACGT TA TCA-AG'GAATCCCTTG *eii3i6~ail s2 

,100 G CA TCG C.TTTA G-C TAATGCTTACAATGAGACGTTATCAAGGAATCCCTTG a909_all.seq 

C T T A T. A GA.C CCTTTTACC AGTAA GCAAAAAGAAGC T"T TCACAGACTATCC. Majority 
€160 . 6170 6180 6190 . . 6200 

CTTATAGACCCTTTTACCAGTAAGCAAAAAGAAGGTTTGAGAGAGTATGC 2603 ail.seq 
CTTATACACCCTTTTACCAGTAAGCAAAAAGAAGGTTTGAGAGAGTATGC 18rs2i_ail seq 
^II ATACACCCTTTTACCACTAAGCAAAAA GAAGGTTTGAGAGACTATCC cohl ail.seq 
CTTATAGACCCTTTTACCAGTAAGCAAAAAGAAGGTTTGAGAGAGTATGC cjblll aiiseq 
CTTATA GA CCCTTTTACCAGTAAGCAAAAAGAA'GGTtTGA GAGA GTATGC ne*316~ail *seq 
CTTATAGACCCTTTTA CCA GTAACCAAAAAGA AGGTTTGA.GAGAGTATGC a909_ali.seq 

T CGTATCCTTC A AG TTC ATG A GCAAATAG GTCATGTGG CA AT CCC.AAGTA Majority 
- • 62 , 10 6220 . 6230 . 6240 " 6250 ' 

198 TCGTATGC T!TG A AG t TC ATG ' 
967 T C G T A T.G C TTG A A, G T T C A t G 

199 T. C G T A T G C T T G A A G. T T C A T G 

199 TCGTATG C T ?.G A A ?G TTC A T G 
mi T C^t A T* G ^i:G:^IC^ffi^ 

200 T C G T A T.C;fe^>^^^ 

ft3»TO , coon • AaA 



>048 T C A A T T 
>8i7 
>049 
S049 
»5l. 
»O50 



098 
«67 
{099 



148 
917 
149 
149 
151 
ISO 



TCGTATGC iTG A ACT TC ATG AG C A A AT A G G 7± C ATG.TGG CX AT C CCA ACT A 2603 ail-seq 
T CC T A T.G C T T G A A. G T T C A T G A G C A A A T A G it V C A T a ?r A n a. a Ti» r* a * £ * **, lr A _ 



626° 6 ffl } - J ' . 6^80 ; - 6296 / ,6300 

T TM;.G G G T T G AT AT TCCAA TTT AT G C.T G.G A AC A T CCG A A ACTC^G CTTC AG 2603 ail.seq 

I I G I GG TTGATA T T C C A AT TT A T G C T.G G A AC A T C.C G A;A A C TC tVC TTC AG I8rsii ail^eq 

3 I o ? S G T J G A T A T T C C A A T -T T A TCCTG C A X C A :T C C G A A A C TC t C C T TC A G cohl ail.seq- 

T T G G.G G T T G A T A T T C C A A T T T A T G C T G G A A C A T. C C G A A A C T G T G C t ' T C A G cjblll aiiseq 

?5* TTGGGGTTCATATTCCAATTTATG.CTCGAACATCCCAAACTCTGCTTCAG nem316~ali Seq 

250 TKGGGTT GAT A T T C C A A T T T A T G C T G G A A C A T C C G A A A C T G T G C T T C A G a909 all se^ 



248 

ai7 

249 
249 
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6298 
6067 
€299 
6299 
6301 
6300 



A 

A 
A 
A 
A 
A 



A A G 

A A G 

A A G 

A A G 

A A G 

A A G 



C 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



G 
G 
G 
G 
G 
C 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



C 
C 

c 
c 
c 
c 



A T T T 

A T T T 

A T T T 

A T T T 

A T T T 

A T T T 



G 
G 
G 
G 
G 
G 



G 
C 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



G 
C 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
C 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



C 
C 

c 
c 
c 
c 



c 
c 
c 
c 
c 
c 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



T C T T C C 

T C T T C C 

T C T T C C 

T C T T C C 

T C T T C C 

T C T T C C 



A G 
A G 
AG 
A G 
A G 
A G 



T G 
T G 
T G 
T G 
T G 
T G 



G G 

G G 

G G 

G G 

G G 

G G 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



G T T T G T C 
G T T T G T C 
GTTTCTC 
GTTTGTC 
GTTTGTC 
GTTTGTC 



2603_aii.seq 
18rs21_.aii.seq 
cohl_aii .seq 
cjblli_ail.seq 
nem316_al 1 .seq 
a909„all.seq 




6348 


A A C C 


C 


6117 


A A C C 


c 


6349 


A A C C 


c 


6349 


A A C C 


c 


6351 


A A C C 


c 


6350 


A A C C 


c 



c 


C A C 


C 


G T 


c 


C A C 


C 


G T 


c 


C A C 


C 


G T 


c 


C AC 


C 


G T 


c 


C A C 


C 


G T 


c 


C A C 


C 


G T 



C 


T 


A 


G G C T 


A T 


C 


T 


A 


G G C T 


A T 


C 


T 


A 


C G C T 


A T 


C 


T 


A 


G G C T 


A T 


C 


T 


A 


G G C T 


A T 


C 


T 


A 


GGCT 


A T 



5448 
5217 

5449 
S449 
5451 
5450 



5498 
5267 
5499 
5499 
5501 
5500 



i548 

317 
1549 
►549 
5551 
1550 



1598 
367 
!599 
S599 
£01 
600 



TTACCCACTTA 
— 1 



6398 
6167 
6399 
6399 
6401 
6400 



6410 
■ 



AATAAAGTTAAAAAAGGCCA GA T T T T 

T , — 



T T A C 
T T A C 
T T A C 
T T A C 
TTAC 
T T A C 



C 
C 

c 
c 
c 
c 



G 
G 
G 
G 
G 
G 



A 
A 

A 
A 
A 
A 



6420 
i 



CTAT G 
1 



TGACGAAC Majority 



C 
C 
C 
C 
C 
C 



T 
T 
T 
T 
T 
T 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



A A T A 
A A T A 
A A T. A 
A A T A 
A AT A 
A A T A 



A A 
A A 
A A 
A A 
A A 
A A 



G T 
G T 
GT 
G T 
G T 
GT 



6430 
i 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 



A A 

A A 

A A 

A A 

A A 



AAA 



A G 
A G 
A G 
A G 
A G 
AG 



G 
G 
G 
G 
G 
G 



C 
C 

c 
c 
c 
c 



c 
c 
c 
e 
c 
c 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



6440 
i : 



6450 



A 
A 
A 
A 
A 
A 



T T T T 
T T T t 
TTTT 
T T T T 
T T T T 
TTTT 



C T 

C T 

C T 

C T 

C T 

C T 



A T 

A T 

A T 

A T 

A f 

A T 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



G A 
G A 
G\A 
G A 
G A 
G A 



G G A A 
CGAA 
CGAA 
CGAA 
CGAA 
CGAA 



C 
C 
C 
G 
C 
C 



2603_ail.seq" 
18rs2i_alKseq 
cohlLaiI,seq 
cj bl 1 l_ai 1 .seq 
nem316_ail .seq 
a909_ail.seq 



A.T C A A G G A 



AACACTTGC C T A C 

i 



6460 



AAAGT'CGT 



G'TCTATCA 

T 



A T C A 
A T C A 
ATX A 
A T X A 
A.T C A 
ATCAAGGA 



A G 

A G 

A G 

A G 
A 



G A 

G A 

G A 

G A 

G A 



6470 
■ 



A A G T 
i 



TGTGGATCC Majority 



A AC ACtTGC'CTAC A A AG 
AACACTTGCCTACAAAG 
AACACTTGCCTACAAAG 
A. AC ACT T G C C T A C A A A G 
AACACTTGCCTACAAAG 
A A C A C T TGCCTACAAAG 



6480 
, i 



TCGTGTCTATCAA 
T C GT G T C.T A T C A A 
T C G T GT CTATCA A 
TCGTGTCTATCAA 
TCGTGTCTATCAA 
TCGTGTCTATCAA 



6490 
I 



A 

A 

A 

A 

A. 

A 



6500 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



T G T 
T G T 
T G T 
f GT 
T.G T 
T G T 



G 
G 
G 
G 
C 
G 



GAT 
GAT 
GAT 
GAT 
GAT 
GAT 



C 
C 
G 
C 
G 
C 



C 
C 
C 
C 
C 
C 



A A C A 



G C T T f A A 
" 1 



GTCAGGTTAAG A T T G T 



6510 

« • 



AACAGCTTTAAGTGAGG 
A A C A G C TTTAAGTGAGG 
A. A CAGCTTT A A GT G A G G 
AACAGCTTTAAGTGAGG 
AACAGCTTTAAGTGAGG 
AACAGCTTTAAGTGAGG 



i 

6520 
— 



C A A T 
i 



GGTAAGG 



A T T A 
1 



TATA 



TT'A 
T T A 
T T A 
T T A 
T T.A 
T T A 



6530 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



T T 
T T 
T T 
T T 
T T 
T T 



G 
C 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



C A A T 

C A A T 

C A A f 

C A A T. 

C A A T 

C A A T 



6540 
— i 



ACCT 
v 



2603_ail-seq 

18rs21^ail.seq 

cohl_;ai'l.seq. 

cjblii_ail.seq 

riem3 16_a i 1 . seq 

a909_all;seq 

Majority 



G 
G 
G 
G 
G 
G 



G T A A 
G T A A 
G T A A 
G T A A 
G T A A 
G T A A 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



6550 

1— - 

TATAACCT 2603Jail_seq 
TATAACCT 18rs21_ail.seq 
TATAACCT cohl_ail.seq 
TAT A AC C T cjblll_ail „seq 
TATAACCT nein316__ai 1 .seq 
TATAACCT a909_ail .seq 




T GCt G 
T GX T G 
T G C T G 
T GX T G 

:% G C T G 
TiGrC T G 



A 
A 

A 
.A 

A 



GT 
C T 

c:t 
c t 

c T 



T G C 

;T C C 
T. G C 
T.G C 
3VG.C 

t g;c 



A C 
AC 
AC 

a'c 

: a;c. 



AT G 

AT;G 

A t G 



A C G'f T A C 
ACC^TAG 

A- C^C'J'vTR :A ; C ;AvT-G 
G X A G A G C C-TAl T. '& ,Crfj&:A:. T£ G. '-A-T _EL : ^fe 



AX 



ATCA. A t ^ G T 

A T C AAT AG T 

A-;T G A A it ^G -T 



C A T C C T.C T.C t t G G f A A A A 2603^ail.seq 
C A t C G-T CTtt t G ClAA A A 18rs2i^ail^eq 
t A TX G T X T^TTG C I t A A A A feofrl^iiiseq 



6610 
t 



G G A G A G X G T AT t C C 
G G.A G A GCGTATTCC 
GCAG'AGCGTATTCC 
GG-AGAGCGTAT T.C C 
C GAG AG CG.TATTCC 
G G AG A GX GTATT C C 



6620 



■ . - .. .... . _ — -- .-- — . — . — —J — 1 

X;-.A T CX\T C T; C^T^ G G T /A A At A' . ; cj b!4 l_l.a 11 Iseo 

T.AXX : C^-G^C^ C; A A A A G C ^ A^C ^A G. A A C A Ifeiorirv 

6640 * .6650 



T T A TG AT TC 
T T A T.G A T T C 
TTATGATT C 
TTATG AT.TC 
T T A T G AT T C 
T T ATG'AT T C, 



6630 

' t- 



TACCG 
T A C C G 
T A C 
T A C 
T A C 
T A X 



A 
A 

C G A 
CGA 
C G A 

C G. A 



CGC 
G G C 
G G C 
G G C 
G G C 
G G C 



G G A A 
C C A A" 
G G A A 
G G A A 
G G A A 
G G A A 



A AG C A CA A A GA A CA 
A A G C A C A A AG A\A C A 
AAGCACAAA G A A C A 
AAGCACAAAGAAGA 
A A G C A C A A A G A A C A 
AAGCACAAAGAACA a909^.aii.seq 



2603^ail.seq 
ISi^sil^ailiseq 
cohI_all.seq 
cjblll_all.seq 
tte«316__ail^seq 
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£98 
AG7 
1699 
699 
1701 
1700 
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- ' 4"£lA„A A TjtgjA ^lcHg^ ^ Til&.PXgO»T kTCTTCAAGATACTACTAGTAT Majority 
g^60 6670 6680 6690 6700 

^ ^^GTACAACATTATCGTTTGTCACTACTGTTGAACATACTACTAGTAT 2603 ail seq 

£!I M^^ TACAAGATTATCGTTTGTCACTAGTGTTGA AGATACTACTAGTAT iSrsil ail seq 
^ J 5 TACAAC ATTATCGTTTGTCACTAGTGTTGAACATACTACTAGTAT cohl all.seq 

>649 AACCGTACAAGATTATCGTTTGTCACTAGTCTTCAAGATACTACTAGTAT cjblll ail seq 

,651 AACCGTACAACATTATCGTTTGTCACTAGTCTTCAAGAT AC TACTAGTAT nem316~al 1 seq 

,650 AACCGTACAAGATTATCGTTTGTCACTAGTGTTGAAGATACTACTAGTAT aSH^all.seq 

TATTAATTGGACTCTTCA TCCTGATAATGATGAGAAGATCGATGCAACAT Majority 
J"™ 6T20 6730 6740 6750 

TATTA ATTGGA CT CT TCATCGTCAT AATG ATGAGAAG ATGGATGCAACAT 2603 ail seq 
TATTAATTGGACTCTTCATCGTGATAATGATGAGAAGATGGATCCAACAT 18rs21 ailseq 
TATTAATTGCACTCTTCATCCTCATAATGATGAGAAGATCGATGCAACAT cohl all. seq 
TATTAATTGGACTCTTCATCGTGATAATGATGAGAAGATCGATGCAACAT cjblTl ail seq 
TAT T A ATTGGACTCTT C'A TCGTGATAATGATGAGAAGATGGATGCAACAT nem316 ail' seq 
TATTAATTGGACTCTTCATCGTGATAATGATG A.G AAGATGGATGCAACAT a909_aii .seq^ 

CCTCAATAACGATGTTGT GAATGGCTTACTTACTTATCAAATACGTGAC.T Majority 

^ f 7 ^ 6780 6790 6800 

™ CGTCAATAACG ATGTTGTGAATGGCTTAl 1 flH 1 C A A A 1 AGGTGACT 2603_ail.seq 

517 CGTCA ATA ACG ATGTTGTG A ATGGCTTACTTACTTATCA AAT ACGTGA CT I8rs21 all seq 

749 CGTCAATAACGATGTTGTGAATGGCTTACTTACTTATCAAATAGGTGACT cohl all seq 

749 CGTCAATAACGATGT T G T GAATCCCTTAC.TT ACTTATCA AAT.AC.GTt ACT cjblTl ailseq 
751 CGT-C A ATA ACGATGTTGTGAATGGCTTACTTACTTATCA AAT AGGTGACT nem316~ai 1 'seq 

750 C G T C A A T A A C G A T GTT G T G A A T G G C TT A C TT ACTT A T C A A A T AG GTG A C T a909_ait.seq 

AATC.ATG ATTGTGAAT A AT GGT T A TCTAGAAGGCACAAAAATGAAAAAGA Majority 
■ ' 68 t *° 6820 6830 6840 6850 

794 AATGATGATTGTGAATAATGGTTATCTAGAAGGGAGAAAAATGAAAAAGA 2603 ail seq ' 
567 A A T G A T C.A T T G T G A A T A A T G G T T A T C T A G A A G G C A C A A A A A T G A A A A A C A 18rs2i ailseq 
799 AATG A T G A TT G T G A A T A A T GG T T A T C T A G A A G G'C AG AAA AATGA AAA A G A cohl all seq 
7?9 ..A AT C.ATGATTGTGAATAATGGTTA TCTAGAAGGCACAAAAATGAAAAAGA cjblTl ailseq 
BOl A.AT. CATGATT CT G A A T A A T GGTT AT.CTA.G A.AGGG A G A A A A A T G A A A A A C A nemS^aiKseq 
SOO AAT G AT CATTGTCAATAATGGTT A TCTAGAAGGCACAAAAATGAAAAAGA a909„aM.seq 

C A C A A A A A A T A T G C A C A G G G T T A T C A G TT A C TT T ACTA A TCCTGTCCCAA Maj or I ty 
, • g 8 . 60 ■ . 6870 6880 6890 6900 

344 GAG A AAAAATATGGAGAGGGTTATCACTTACTTTACTAA TCCTGTCCCAA 2603_aii seq 

317 G A G A A A A A A T A T Q G A G A G G G T T A T C A G T T A C T T T A C T A A T C C T G T C C C A A i8rs21_ailseq 

J49 GACAAAAAATATGGAGAGGGTTATCAGTTACTTTACTAATCCTGTCCCAA cohl ail seq 

349 GACAAAAAATATGGAG. AGGGTTATCAGT'TACTTTACTAATCC T GTG C C A A cjblTllailseq 
551 GACAAAAAATATGGAGAG. GGTTATCAGTTAC TT T A CT AATCCTGTCCCAA nem316_aU *seq 

350 GACAAAAAATATGGAG AG G.GTTATCAGTTACTTTACTAATCCTGTCCCAA a909_ail.seq 

ATT C C ATTTGGT AT AT T GG T A.C A A CCTCAA AC C C A A GA TACCAATCAAGC Majority 

, 69 / 0 , 6920 6930 6940 .. .6950 

i94 ATT (JC ATTT.CGT ATA TT GfTAGA ACG T G A A A C C C A AG A T A C C A A T C A A G C 2603 all.seq 
!?I A 5 J I G ? T A pATTC CT A C A A G:C T S G AiA Ajc C C A A G ^ T A C C A A : T C A A C C '■ 18rs2Uaii!seq 
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t99 

$99 A 



#T^-,C,.C,A TJT T G G T.A T.T C G T A;'C A,A G G T G A A A ;C f C C A A G A T A C C A A T C A A G C cofcl all.seq 



A G;T T^G;G A A^AvA'C T A AT T GT T A A- A :.Ai Ak A A CG G:C A C'A? C A A . T ' G C- T : A C . A C ■ C A T- T K G Ma I tirii.y- 

€Q6p . . . ; 6970 ' .6980 6990 ; 7000 ; 

144 ' A CTTC C A A A ACTA A T T G" T* T A A- AAA AACGC GAGA C AATG C T A C A G C A T T AG 2603 ail.seq 

ri7. : A'CIT G G A A A A G T A A T T G T T A A AAA A ACGGGACA C A A T G C T A CACC At T A C 18rs21 ail!seq 

W9 AC T T G C A A A ACT AAT T„G T T AAA AAA ACCGG A G AC A AT G C T A CACC AT TAG cohl all.seq 

»49 ACTTGCAAA AG TAATTC.TTAAAAA AACGGGACAC AATGCT AC A C C A T T AC cjblll ailseq 

)Sl A CT T C C A A A A G T A A T T C T J A A A A A A A C G G G A G AC AATG C T AC ACCATTAC nem316 ailseq 

>50 AG T T G C A A A AG T A A T TGT T A AAA AAACCG GQ G A CMTGCTACACCATTAG a909 all.seq 
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G C A A A G CffciUc T f TV^G"^ fe'TT A A k'"A A l"Y — ~— 




, GACA ATGA T A AGTCAGAAACAAGT Majority 

7010 7020 7030 704O 70*50 



5994 
3767 
3999 
5999 
TOO I 

robo 



' 1 ^ : « _i 



7044 
3817 
T049 
T049 

rosi 
roso 



T094 
>867 
T099 
r099 

noi 
noo 



£? AAA °^ 9ACTTT TGTGT TAAAAAATCACAATGATAAGTCAGAAACAAGT 2603 ail seq 
GCAAACCGACTTTTGTGTTAAAAAATGACAATGATAAGTCACAAACAAGT 18rs21 all sea 
GCAAAGCCACTTTTGTGTTAAAAAATGACAATGATAAGTCAGAAACAAGTcohlaTl sea 
GCAAAGCGACTTTTGTGTTAAAAAATGACAATGATAAGTCAGAAACAAGT cjbHl aiiseq 
GCAAACCGACTTTTGTGTTAAAAAATGACAATGATAAGTCAGAAACAAGT nem316~all seq 
GCAAAGCGACTTTTCTGTTAAAAAATGACAATGATAAGTCAGAAACAAGT a909_a7l . seq 

CA'CGAAACGGTAGAGGG. T TCTGGAGAAGCAACCTTTGAAAAGATAAAACC Majority 

7( y° 7070 7080 709O 7100 * 

C A C C A A A C G CT AG AC G G T T C T G G A C A A G C A A CCT TT G.A A A AC AT A A A A C C 2603 ail seq 
C AC GAAACGGTAGAGGGTTCTGGAGAAGCAACCTTTGAAAACATAAAACC ISrsIl ail seq 
rA^r^AA^^^J^^^^* 6 G T T ^ TG GAGAAGCAACCTTTGAAAACATAAAACC cohl_.a7l.seq 
S^^M^^^^CCCTTCTGGAGAAGCAACCTTTGAAAACATAAAACC CJbHl ailTseq 

CACGAAACGGTAGAGGGTTCTGGAGAAGCAACCTTTGAAAACATAAAACC a909_.aU.seq 

TGCAGA CTACACATTAAGAGAAGAAACAGCACCAATTGCTT AT A A A A A A A Majority 
7 * 1Q 7120 , 7130 . 7140 71S0 

TCGACACTACACA TTA A GAGAAGAAACAGCACCAATTGGTTATAAAAAAA 2603 aii.seq 

J^^^^^^^*^*^^^ TAA0AGAAGAAACAGCACCAATTGGTTATAAAAA A^ 18rsil_ail!seq 

^5 A ^ A ^ TAGACATTAAGAGAAGAAACAGCACCAATTGCTTA cohl_.aiI.seq 
l G .^ A ^ ACTACACATrAKGAGAAG ^^^CA^CACCAATTGGTT/LTAAAAAAA cjblll ailTseq 
T G G A G A C T A- CACATTAAGACAAGAA AC AGCACCAATTG G.T TATAAAAAAA nemSie^ail.seq 

TGGAGACTACACATTAAGAGAAGAAACAGCACCAATTGGTTATAAAAAAA a909__a7l .seq 

CTGATAAAACCTGGAAACT TAAAGTTCCAGATAACGGAGCAACAATAATC Majority 
■ J* 6 ^ 7170 7180 7190 7200 

CTGAT. AAAACCTGGAA.AGTTAAAGTTGCAGATAACGGAGCAACAATAATC 2603_ail.seq 
CTG.ATAAAACCTGGAAAGTTAAAGTTGCAGATAACGGAGCAACAATAATC I8rs21 ail^seq 
G I G A T A A A A C C T G G A A A G T T A A A C T T G C A G A ' T A A C G G A G C A A C A A T A A T C cohl_ail seq 
CJ.GATAAAACCTGGAAAGTTAAAGTTGCAGA T A A CGGAGCAACAATAA T C " cjblil ailTseq 
CTGATAAAACCTGGAAAGTTAAAGTTCCAGATAACGGAGCAACAATAATC nem316~all seq 
CTGATAAAACCTGGAA.ACTTA^AAGTTCCACATAACGGAGCAACAATAATC a909_ail.seq 

GAGGCTATGGATCCAGATA AAGCAGAGAAACGAAAAGAAGTTfTGAATGC Majority 
72 , 10 7220 7230 ' 7240 72SO 

GAGGGTATGGATGCAGATAA AG CAGAGAAACGAAAAGAAGTTTTGAATCC 2603 all seq 
CAGGGTATGGATGCAGATAAAGCAGAGAAACGAAAAGAAGTTTTCAATGC 18rs21_aii seq 
GAGGGTATGGATGCAGATAAACCAGAGAAACGAAAAGAAGTTTTGAATGC cohl_ail seq 
GAGGGTATGGATGCACATAAAGCAGA G.A A ACGAAAAGAAGTTTTGAATGC cJMil ail seo 
GAGGGTATGGATCCAGATAAAGCAGAGAAACGAAAAGAAGTT T T C AATGC neia3l6~ail 'seq 
GAGGGTATGGATGCAGATAAAGCAGAGAAACGAAAAGAAGTTTTGAATGC a909_ail.seq 

C C A A T A T C C A A A A t C A G C T A T T T A T G A G C A T A C A A A A G A A A A T T A C C C A T Majority 

7260 7270 7280 • - 7290 7300 

2fJ'- £ ^ i A £ A ' T * C A A A ^. CA ^TATTTATC.AC CAT AC A A A A G A A A ATT AG C G A T 2603 alUsefr 
SIX I I ? ? A i A \ IS A G G T A T T I A T G A G - C A A A A A G A A A A T T A C C- C t 'A T. IBr^aiHL, 

^49 r £ a t I tl: ? ? A a A A I G '.t G £ I^ II I A I G A 5 G A I A ? A A A A § A A A A 1 1 A G C G At. eihl_.ali.^e4 

"250 

TAG T T A A ,T G. T- A G A G G G T T C .C . A A A G T T G G t G A A C A A T A C A A A G C A -T T G A A V Wt orl trv 

73 . 10 73.20 7330 . 7340 . 7350 . 

T 9 * T A G T T A A T G T A G A G G G T t.C C A A A G T T C G T G A A C A A T A C A A A G C A T T G A. -A T 2603 ail seq 

'067 T A G T T A A T G T A G A G G G T T C C A A A G T T C G T -G A A C A A T A C A A A G C A T T G A A T 18rs21 ail seq 

^299 TAGTTAATGTAGAG G.C TTCCAAAGTTGGTGAACAATACAAA'GC AT T G A A T ctXii all seq 

2S I AG I TAATG T AGAGGGTTC CAAAGTtGG.TGAACAATA C A A A G C AT T C A A T <:j bi 1 l_a i ITseq 

KiOi T.A G T T A A T G : T A GA G G G T T CC A A A G TT G G T G A AC A ATACAAAGCA TT G A A T nem316 ail seq 

r300 TAGTTAATGTAGAGGGTTCCAAA'GTTGGTGAACAATACAAAGGATTGA AT a909 aTl. seq 



ri44 

>917 
fl49 
ri49 

riso 



ri94 

>967 

ri99 
ri99 

^201 
f200 
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7344 

nt7 

T349 
T349 

r3so 



r394 
'167 
r 399 
tt99 
'401 
r 4O0 



'444 
^217 
'449 
'449 
r 4Sl 
450 



494 

267 
499 
499 
501 
500 



544 
317 
549 
549 
551 
550 



C C A, A T A AjgCp A'>i¥i tTi 



, in ii^ ""i'i* " "'}} 



C C 

c c 
c c 
c c 
c c 
c c 



7360 
i 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 

A 
A 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A T 
A T 
A T 
A T 
AT 
A T 



C G 

G G 

G C 

G C 

G G 

G G 



A A 

A A 

A A 

A A 

A A 

A A 



A 
A 

A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



C 
G 
G 
G 
G 
C 



C' T 'b "G 1 A A G A C "A GATTGCTGAA G GTTGGTT 

i ■ — 1 — 

7370 7380 7390 

1 



ATC Majority 



G 
G 
G 
G 
G 
G 



T C G A 

T C G A 

T C G A 

T C G A 

T C G A 

T C G A 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



A G 

A G 

A G 

A G 

A G 

A C 



A 
A 

A 
A 
A 
A 



C 
G 
G 
G 
G 
C 



ATT 
ATT 
ATT 
ATT 
ATT 
ATT 



G 
G 
G 
G 
G 
G 



C T G 
C T G 
C T G 
CTG 
C T G 
CTG 



A A 
A A 

A A' 
A. A 
A A 
A A 



G G 
G G 
G G 
G G 
G G 
G G 



T 
T 
T 
T 
T 
T 



T 
T 
T 
T 
T 
T 



G 
G 
C 
G 
G 
G 



74O0 
— 1_ 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



T 
T 
T 
T 
T 
T 



ATC 
ATC 
ATC 
ATC 
ATC 
ATC 



A A A A 




A A 

A A 

A A 

A A 

A A 

A A 



A A 

A A 

A A 

A A 

A A 

A A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



ATT 
ATT 

aH 

A T 
A T 
A T 



T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



C 
C 
C 
C 

c 
c 



A 
A 
A 
A 
A 
A 



G G G 

G C G 

G- G G 

G G G 

G G G 

G G G 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



C 
C 
C 
C 
C 
C 



A A T G 
A A T G 
A A T G 
A A T G 
A A T G 
A A T.G 



ATC 
ATC 
ATC 
ATC 
ATC 
ATC 



T C 

T C 

T C 

T C 

T C 

T C 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A* 
A 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



A 

A 
A 
A 
A 
A 



A T 
A T 
A T 
AT 
A T 
A T 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



T 
T 
T 
T 
T 
T 



G 
G 
G 
C 
G 
G 



2603j_ail .seq 
18rs21_ai l.seq 
cohI_ai I . seq 
cjblll_ai l.seq 
nem316_ai l.seq 
a909_ai l.seq 



Majority 



2603_all.seq 
18rs2l__ai l.seq 
cohl_al l.seq 
cjblll_ai l.seq 
nem3 16_a 1 1 . seq 
a909_al l.seq 



AATTAAGTG 



T T 
n — 



G A G G G T 



A 
A 
A 
A 
A 
A 



7460 
1 



AAA 
1 



A C C A C T 



GTTGA 
r — 



A A C G A A 



7470 
1 



7480 
1 



A C A A 
1 

7490 
1 



CTTAAT 



C A A Majority 
7500 



ATTAACTGTTGAGGGTAAAACCACTGTTGAAACGAAAGAAC TTAAT 
!IIMr GTTGAGGGTAAAACCACTGTTGAA ACCAAAGAACTTAAT 
ATTAACTGTTCAGGCTAAAACCACTGTTCAAACGAAACAA CTTAAT 
ATTAACTGTTGAGGGTAAAACCACTGTTGAAACGAAAGAACTTAAT 
A T T A A C T G T T G A G G G T A A A A C C A C T G T T G A A A C G A A A C A A C T T A A T 
AT.TAACTGTTGAGGGTAAAACCACTGTTGAAACGAAAGAACTTAAT 



C A A 

C A A 

C A A 

C A A 

G A A 

C A A 



2603^_al l.seq 
18rs2I_all;seq 
cohl_ai l.seq 
cjbin_ai l.seq 
riem316_ai l.seq 
a909_ail .seq 




C 
C 
C 
C 
C 
C 



c 
c 
c 
c 
c 
c 



A 
A 
A 
A 
A 
A 



C 
C 

c 
c 
c 
c 



T A 

T A 

T A 

T A 

T A 

T A 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



G T 
G T 
G T 
G T 
G T 
G T 



C 
C 
C 
C 

c 
c 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



T 
T 
T 
T 
T 
T 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



G 
G 
G 
G 
G 
G 



C 
C 
C 
C 
C 
C 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



T T 
T T 
T T 
T T 
T T 
TT 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



A 
A 

A 



ATAATTCAA 
A T A A f T C.A.A 
ATAATTCAA 
A T A A T T C A A A 
ATAA.TTCAAA 
ATAATTCAAA 



T 
T 
T 

•T 

T 
T 



A 
A 
A 
A 
A 
A 



G T A T 
G T A T 
G T A T 
G T A T 
GTAT 
GT AT 



GAATAATGA 2603_ail.seq 

GAATAATGA 18rs21_jail:seq 

GAATAATGA cohl_ail.seq 

G A A T A A T G A cjbUl_aiI.seq 

GAATAATGA neo316_al l.seq 

GAATAATGA a909_ai l.seq 



AAGAGCCAATAATTCTCA AAG.AGCATTAAAAGCTGGGGAACCA G TT G A A A 

7560 * 7570 7580 7590 ~~7600 




Majority 



A 
A 
A 
A 
A 
A 



A. G 
A G 
A G 
A G 
A G 
A G 



A G 
A G 

A. G 
A G 
A G 
A G 



C 
C 
C 
C 
C 
C 



C 

c 
c 
c 
c 
c 



A 
A 
A 
A 
A 
A 



ATA 
ATA 
ATA 
ATA 
ATA 
ATA 



ATT 
AT T 
ATT 
ATT 
ATT 
ATT 



C 
C 

c 
c 
c 
c 



T 
T 
T 
T 
T 
T 



C 

c 
e 
c 
c 
c 



A 
A 
A 
A 
A 
A 



A A 

A A 

A A 

A A 

A A 

A A 



G A 

G A 

G A 

G A 

G A 

G A 



G C 
G C 
G C 
G C 
G.C 
G C 



A T X A 

A T T A 

A T T A 

A TT A 

A T T A 

A T T A 



A 
A 

A 
A 
A 
A 



A A 
A A 
A A 
A A 
A A 
A A 



G 
G 
G 
G 
G 
G 



C T 
C T 
C T 
C T 
C T 
C T 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 

a 



G 
G 
0 
G 
G 
G 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



C 
C 
C 
C 
C 
C 



A 
A 
A 
A 
A 
A 



G T T 

G T T 

T T 

T T 

T T 

T T 



G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



AAA 
AAA 
A A 
A A 
A A 
A A 



A 
A 
A 
A 



2603_lai l.seq 
18rs21_ai l.seq 
cohl_al l.seq 
cjblli_ai l.seq 
neat3 16^a il . seq 
a909_ai l.seq 



594 
367 
599 




GOl 

BOO 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



C T G A 

CTG A 

C T G A 

C T G A 

G..T G A 

CTG AT T! G A T A 



T T G A T A 
TT G AT A 
T T G ATA 
TTGATA 
T t G A T A 



A A A T T A 
A A ATT A 
A. A . A T T A 
A A A.f T A 
A A A T T'A 
A A ATT A 



C\A T 
CAT 
C AT 
G A-T 
CAT 

c k-% 



CA A A T 
C A A A T 
G A A AT 

■g>a.*AvAt 
t Aja- aVt 

C A.A AT 



A A A G. A C A A T A C A G" T A G .C T C I T G T G 2603_ail.seq 
A A A G A C A A T A G A C. T A £ C TCTtGTG l8rs21_ail .seq 
A A A G A G.A A T A G- A G. T A G C t C T T G T G coKl - ail «ed 



A CAT A T G C C T C A A C C AT T TI T C A T Q G 




644 
417 
549 
B49 
S51 
550 




A CAT A T G :C C T C A A C C A T t T T T G A T G G T A C T G A A G C G A C C G TAT C A A A 1, u 
A CA T A T G C C T C A A C C A T T T T T G A T G; G T A CTG A A G C G. A C C G T A T C A A A G G 



G G 



GCGACCGTATCAAAGGG 



18rs21_ai l.seq 
cdhl_jaillseq 
cjbHl_ai l.seq 



A C A T AT G CC TC A-AC C ATTTTt.C ATCCT4CIC A A _ - „ . L;fUliI UI , SGa 

^rATi^T-^^n^oA 1 ^^ < '^^ TT ^ , ^^A^^^ T -ACTGAAGCGACCGTATCAAAG GG ne^ 6 "a "s^ 
ACATATCCCTCAACCATTT TT CATC G.T A C T G A. A CC G ACCGTATC A A AG GG SaTi l^T 
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ton^A.nortnfALi WO 2006/078318 ^TT - 7 ' ^PCT/US2005/027239 
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A C T T G C <K 3 LTlA'Ay^ Ufe gWgc gcj_f c4 ATCATAf, TGTATCATGCGATT Maiori^ 1 

7710 7720 7730 7740 Wsn 



744 
r 517 
749 
749 
751 
750 



jl_ _ , _ t 7740 _ 7750 

hStl A I ^£ C G A T C A A A A T G C T A A A G C G C T C A A T G A T A G T G T A T C A T G G C A T T 2603 ail seq 

™ AGTTGCCGATCAAAA TGGTAAACCGCTCAATGATAGTGTATCATGCGATT ISrsZl ailseq 

699 AGTTGCCGATCAAAATGGTAAAGCGCTGAATGATAGTGTATCATGGGATT cohl aU seq^ 

699 AGTTGCCGATCAAAATGGT A A AGCGCTGAATGATAGTGTATCATGGGATT cjbMi ail seq 
T701 ACTTGCCGATCAAAATGGTAAACCGCTGAATGATAGTGTATCATGGGATT «em316~ail s2 

700 A G T T G C C G A T C A A A A T G G T A A A G C G C T G A A T G A T A C T C T A T C A T G G G A T T a909_ail 

ATCATAAAACTACTTTTA CACCAACTACACATAATTACAGTTATTTAAAT Majority 

7760 7770 * 7780 7790 7800 

- 1 » t i t 

A T"C A T A A AACTACTTTTACAGCAACTA C'A C A T A ATTACAGTT ATTTAAAT 2603 ail sea 
nCATAAAACtACTTTTACAGCAACTACACATAATTACAGTTATTTAAAT 18rs2i ailseq 
^CAT.AAAACTACTTTTACAGCAACTACACATAATTACAGTTATTTAAATcohi all. seq 
ATCATAAAACTACTTTTACAGCAACTACACATAATTACAGTTATTTAAAT cjbUi ailHseq 
ATCATAAAACTACTTTTACAGCAACTACACATAATTACAGTTATTTAAAT ne_316~aii seq 
ATCATAAAACTACTTTTACAGCAACTACACATAATTACAGTTATTTAAAT a909_aiKseq 

TTAACAAATGATCCTAACGAAGTTAATATTCTAAAGTCAACAATTCCAAA Majority 
. * . 7810 7820 7830 . 784Q 7850 ~ 

794. TTAACAAATGATGCTAACGAAGTTAATAfTCTAAAGtCAAGAATTCCAAA 2603 all seq 

567 TTAACAAATCA.TGCTAACCAACTTAATATTCTAAAGTCAAGAATTCCAAA ISrsil aii!seq 

T'TAACAAATGATGCTAACGAAGTTAATATTCTAAAGTCAAG A ATT CC AAA cohl ail seq 

TTAACAAATGATGCTAACGAAGTTAATATTCTAAAGTCAAGAATTCCAAA cjblli ailseq 

TTAACAAATGATGCTA AC GAAGTTAATATTCTA A AG T C A A G A A T TC C A A A ne*3 I6~all sea 

TTAACAAATGATGCTAACGAAGTTAATATTCTAAAGtCAAGAATTCCAAA a909_all . seq 

GCAAGCCGAGC A. TATAAATGGGGATCGCACGCTCTATCAATTTGCTGCGA Majority 
7860 7870 7880 7890 7900 

B44 CCAACCCCAGCATATAAATGGCGATCGCACCCTCTATCAATTTGCTGCGA 2603 ail seq 
617 CGAAGCGGAGCATATAAA'TGGGG AT CGCA CGCTCfATCAATTTGGTGCGA ISrsil ailseq 
849 CO. A AGCCCAGCATATAAATCCGGATCGCACGCTCTATCAATTTCCtGCCA cohi all seq 
849 CGAACCGGAGGATATAAATGGGG.ATCGCACGCTCTATCAATTTGGTGCGA clblH ailleq 
»«« G G A*A GCGGAGCATATA A' A TGGGGATCGCACGCTCTATCAATTTGGTGCGA ne^lG'ailseq 
GGAAGCCGAGCA TATAAATGGGG A TCGC A. CGCTCTATCAATTTGGTGCGA a909_.ail.seq 

CATTTACTCAAAAAGGTCT AAT-CAAACCAAATGAAATTTTAGACACACAA. Majority 

7910 7920 7930 7940 7950 
" - 1 ■ • L 1 1 t 



799 
799 
801 
800 



851 
850 



894 
667 
899 
899 
901 
900 



CATTTACTCAAAAAGCTCTAATGAAAGCAAATGA. AATTTTAGAGACACAA 2603 ail seq 
CATTTAC.TCAAAAAGCTCTAATGAAACCAAATGAAATTTTAGAGACACAA 18rs21 ailseq 
CATTTACTCAAAAAGCTCTAATGAAAGCAAATGAAATTTTACAGACACAA cohl ail seq 
C A T T T ACTCAA A AAGCTCTAATGAAAGCAAATGA AATTTTAGAGACACAA cjblli ail seq 
C A T T T A C T C A'A A A A.G C T C T A A T G A A A G C A A A T G A A A T T T T A G A G A C A C A A- nea316~ail.seq 
CATTTACTCAAAAAGCTCTAATGAAAGCAAATGAAATTTTAGACACACAAa909 all.seq 



A G T T C T A A T G C T A G A A A A A A A C T T A T T T T T C A C G T A A C T C A T G G T G.T C CX Majority 

7960 797°' 7?. 80 - -, ." 7990. ' ' 8000 

944 AC t TC T A AT GC T A :G A A A A A,A A C t T A TTT.T T C AC G T A A C UATGC T GT CC G 2603 ail seq - 
tl7 A G T T C T A A.T G C T A G A A A AAA A€ T TAT TT T T C A € G T A A C T/G'A T G G T- G: T,G .C C 18rs21 aljkseq 

949 ACT TC'.T A A T.G G T.-A/G A A. A A A 0 C T T if ^ t T 1! G A' C G A CT Gl AMe^ C tfec C « ' "/ • 
349 A G.T;T C T A A X G.C T; A G A A > A AA^G f 'T,A^.TvTvT T^AVCW^A^^ J C 

951 . A* T'T^C TAATGC F^^A-** A ^ : V 

950 A G T T C T A A T G'-C -T A C A A A A^AWtf',^ & rjfi i » w 

T. A. G G A T G T C TH A- T C'C C A T A A A T T t- j A A t C C T T AT A ttte^^ C !T^A C C llilorinv : . . ' 

8( . 10 . 8020 \ . SOW 8040 8050 ' 

T A C G A T G T C T T A T G C C ATA A A T T T T A A T C G T T A T A T A T C-A A C A T C T T A C C 2603 ail seq 

T A C G A T G T G T T A T G C C A T A,A A T T t T A A T C C T T ATA T A T C A A C A T- C T T A C C 18rs21 ailseq 

TAC^ATCTCTT A T.G C C A T A A A T T T T A A T C C T T A T A T A T C A A C A T C T T A C C cohl_aTl.seq 

T AC G A T.G TC T T-ATCCCAT AAATTTT AATCCTTATATATCAA-C ATCTTACC cjblli ail!seq 

D01 TACG ATGTC.TTATGCCATA A A T T T T A A TC C T T ATATATCAAC ATCTTACC nem316 _ aIUs2 

30O TACG A TGTCTTATGCCATAAATTTTAATCCTTATATATCAAC ATCTTACC a909 ailseq ' 



994 
767 
999 
999 
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*«gnroem Heport of AI-1_««gnmeni, using J. nem roetftod with Weighted residue weight table, d <>a 

A A A A C C jg gp A aSSI W'l gffi F rfffj ATACCAGATACA ACTCCTATT Ma.or.r. 

8 06° 8070 8080 8O90 8100 

J — 1 L 



194 
967 
199 
199 



EE »^M^t CTTTAATTCTTTTTTAAATAAAATACCAG ATAGAACTCGTATT 2603 ail seq 
2il AAAA ^ACTTTAATTCTTTTTTAAATAAAATACCACATAGAAGTCCTATT ISrsIl aiHeq 
£^ AAAA CCAGTTTAATTCTTTTTTAAATAAAATACCAGATAGAAGTGGTATT cohlall sea 

AAAACCAGTTTAATTCTTTTTTAAATAAAATACCAGATAGAAGTGGTATT cTbm "iHea 
^A^CCACTTTAATTCTTTTTTAAATAAAATACCAGATACAAGTGG^AiT ^ " 
!050 AAAACCACTTTAATTCTTTTTTAAATAAAATACCACATAGAAGTGGTATT agoajl.'seq 

CTCCAAGAGGATTTTATAAT CAATCGTCATGATTATCAAATAGTAAAAGG Majority 
81 . 10 8»20 8130 8140 81SO 

™* CCAACAGGATTTTATAATCAATGGT CATC ATTATCAAATAGTAAAAGG 2603 all.seq 

W57 "CT-CCAAGAGGATTTTATAATCAATG. GTCATGATTATCAAATAGTAAAAGG 18rs Il alHeq 
^15 A GGATTTTATAATCAATGGTGATGATTATCAAATACtAAAAGC cohtin "eT 
^™CAACA G GATTTTATAATCAATGGTCATCATTATCAAATACTAAAAGG cjblll all^eq 
101 CTCCAAGAGCATTTTATAATCAATGCTCATCATTATCAAATAGTAAAAGC „e«316~aU S 
ItOO CTCCAACACGATTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGC a^.auls^ 

ACATCCACAGAGTTtTAAACTCTTTTCCGA-TACAAAAGTtCCTGTTACTC Majority 



344 
117 
349 
349 



SI 60 81TO 8180 8190 



8200 



5144 AGATGCAGAGAGTTTTAAACTGTTTTCGCATAGAAAACTTCCTGTTACTC 2603 ail seq 

^17 AGATGGAGA G.A G T T T'T AAACTGTTTTCGCATAGAAAAGTTCCTGTTACTG I8rs21 ailseq 

1149. AGATGGAGAGAGTTTTAAACTGTTTTCGGATAGAAAACTTCCTGTTACTGcohlaTl se^ 

1149 AGATGGAGAGAGTTTTAAACTGTTTTCGGATAGAAAAGTTCCTGTTAC-TG cfblll ailsea 

liSl AGATCGAGAGAGTTTTAAACTGTTTTCGCATAGAAAAGTTCCTCTTACTG nem316~ail "seq 

!150 AGATGGAGAGAGTTTTAAACTGTTTTCGGATAGAAAAGTTCCTGTTACTG a909_a7l.seq 

GAG*GAACGACACAAGGA GCTTA. TCGAGTACCGCAAAATCAACTCTCTGTA Majority 
■ 82*0 . 8220 8230 8240 8250 

G A G : G AACGA C A ,C A A G C AC C T T A T C G A C'T A CCGCAAAATCAAC-TCTCTGTA 2603_all seq 
G AGG AACGA G AC.A AG C A.GCTTATCG AC T- A GCGCAAAATCAACTCTCTGTA 18rs21_ail^eq 
GAG-GAACGAC. ACAAGCAGCTTATCGAGTACCGCAAAATCAACTCTCTGTA cohl ail seq 
GAGGAACGACACAAGCAGCTTATCGAGTACCGCAAAATGAACT C.T C'T G T A ciblll ailseq 
5^ GAGGAACGA-CACAAGCAGCTTATCGAGTACCGCAAAATCAACTCTCTGTA nem316~ail!seq 
200 GACGAACGACACAAGCAGCTTATCGAGTACCGCAAAATCAACT.CTCTGTA a909 all seq 



ATGAGTAATGAGGGATA TGCAATTAATAG T G CA. TATATTTATCTCTATTG Majority 

^270 * 82S0 82*30 8300 

244 ATGAGTAATGAGGGATA T.G CAATTAA. TAG TGGATATAT TTAT.CTCTA T T G 2603 ail seq 

017 ATG AGTAATG A GGGATATG CAATTAA T -A GTGGATATATTTATCTCTATTG 18rs2~l ail seq 

249 ATGAGTAATGAGGGATATGCAATTAATAGTGGATATATTTATCTCTATTG cohl all seq 

249 ATG A GTAATGAG GGATATGCAATTAATAGTGGATATATTTAT'CTCTATTG cjblll ailseq 
251 ATGAGTAATCACGCATATGCAATTAATAGTGGATATATTTATCTCTATTG ne*316~all seq 

250 ATG AGTAATGAG G G A T A T C C A A T'T A A T A G T G G A T A T A T T T A T C T C T A T T G a909>il.seq 

G A C A C A T T A C A A C T G G G T C T A T C C A T T T G A T C C T A A G A C,A:A A G A A A G T T T Majority 

• • - • \. - ■ r ' i. . . 1 "j "— 1 — t J J • 



8310 8320 . 8330 8340 



^ 4 G A G A G A T T A C A A .G'-t-G' G G T;CT A T C C A T* T T G. fc$ C.C T A A G A C A A A G A A A G T T T 2603 ail Seq 

*oo Si S f S f H ' C A * C T T C ^^^A^Tt***^^-^^**?-* A ADg ;A A A : G ;TT T ifez^U^eq 
299 GAGA G.A T T A C A 1 A C T ;C G, T ^ T , Jfc* K jR^fm t;t ii *;ir r» 'n a h ~* *> ^)k-,v^x>*, S'.-a ::. - - 7; * 

299 G A. C A G A T T A G . 
301 G A 4p A G A T TAG A 

300 G A G A G A T T A.C, 

. £ T: G.C A A C-G A. A A^C'A^.T^C^viX^'^ C- T;t:"-T '.|laior£tv :• 

1 ••■■■»■ ■ . " ^. .; | - . -4^-'^ •• • - - . ■ , ' ". '' |,", ' :"* ' 11 . 1 : ■ ■ i • ■ - r* J 

8360 8370 ; . 8380 ; ; 839Q . 8400 

C T/G C A A C G A A A C A A A T C A A A A C T C A T C. G T G A G.C C A- A C .A A •€ AT T A T A C T T t 2603_ail seq 

C.T G CAACC A A AC A A AT C A A A AC T.C A.tGGTG A G C C A A C A AC ATT A T A C T T T 18rs21 allUseq 

OT,G C A A C G A A AC A -A ATCAAA A C T C A T G G T G A G C CA AC A A C A T T A TA C T T Tcohllall ieq 

CTGCAACGAAACAAATCAAAACTCATGGTCAGCCA AC. A ACATTATACTTT cjblll ail^se^i 

!fi ^ I S C A A ° ° A A A P A AA T ° A A A ACTC A ? C G Tic AG C CA AG A A C A T T A T A C T t T neaiSie aiKseq 
3SO C T'C C A A C C A A A C A A A T C A A* A. A C TC ATG G T GAG G CA A t A A CAT T A T AC T'T T a909 all 



8350 
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A A T G G A jfr , A A^ te fcMtt A' Aii£; C^T Eg PK CATTTTTACTGTTGCCAT TGC Majority 



8394 
8167 
8399 
8399 



. 8420 . . 843Q, gy p 8450 

AATGGAAATATAAGACCTAAAGCTTATGACATTTTTACTGTTGGGATTGG 2603 ail sea 
!^^M ATATAAGACCTAAACCTTATGACATTTTTACTGTTGC GATTGC I8rs21 ail sea 
AA IJr* AAA I A I AA * ACCTAAAG ^ alhseq 
AATGGAAATATAACACCTAAAGCTTATGACATTTTTACTGTTGGGATTGG cfblll ail sea 
8401 AATGGAAATATAACACCTAAACGTTATGACATTTT.TACTCTTGCC • ne«3 6~~a 12 

8400 AATGGAAATATAAGACCTAAAGGTTATGACATTTTTACTCTTGGGATTGG _9^1!u ^ 

TGTAAACGGAGATCCTGCT GCAACtCCTCTTCAAGCTGAGAAATTTATGC Majority 
. f* 6 ^ 8470 8480 8490 8500 

I r t AAAG ^ AGA I GGTGGTGCAACTCCT CTTGAAGCTGAG 2603_.ail.seq 
TCTAAACGGAGATCCTGCTG€AACTCCTCTTGAAGCTGAGAAATTTATGC 18rs21 ail sea 
3^ AAAGGGACATCCTGCTGCAACTCCTCTTGAAGCTGAGA A sea 

^r^ AAA ^^ A ^ A ^ GTGGTGCAACTCCTCTTGAAGCT <tfM« ali^eq 

151**^55^*1?? ne_316_alKseq 
TGTAAACGGAGATCCTCCTCCAACTCCTCTTCAAGCTGAGAAATTTATCC a909_ail.seq 

A A T C AkA TATCAAGT A, A A A C A G A A A ATT A T A CT AATGTTCATC ATACAAAT Majority 

S5 m W 8520 8S30 8540 8550 

AA I^ AATATCAAGTAAAACAGAAAATTAT ACT MTGTTGATGA 2603 all seq 

1499 AA'TCAATATC AA^r^AA^A^r^^^ A A A A ^ A G T A A. T G T T GA.TG ATACAAAT ISrsSi.all^eq 

ISi AA I GAA ^ A ^ GAAGTAAAACAGAAAA TTATACT-A AT GTTGATG ATACAAAT cjblTl ail^eq 
' AA TAAAACAGA A A AT TATACTAATGTTGATG ATACAAAT ne«316~all.s5 

AATCAATATCAAGTAAAACAGAAA-ATTATACTAATGTTGATGATACAAAT a?09._all .seq 

AAAATTTATGATGAGCTA AATAAATACTTTAAAACA AT TCTT CACC A AAA. Majority 



B444 
B217 
B449 
8449 
B45I 
.450 



8494 
8267 



5501 
8S00 



3544 



8560 8570 8580 8590 . 8600 

- 1 — ' ~~- = «- 1 , L 



AAAA "I A I G VI GAGCTAAATAAATACT T-T A-A A A C A A T T G T T C A C G A A A A 2603 ail.seq 

«_I AAAA ^n A ^ ATGACCTAAATAAATACTTTAAAACAATTGTT 18rsil ali^eq 

MaaII I ATG ATGACCTA AATA A At ACTTTA A A AC A A T T G T T G A G G A A A A cohl ail.seq^ 

^ * AAA ^n AT S A T G AGCTAAATAAA TACT.TT.AAAACAATTGTTGAGGAAAA cjblli alHeq 

A A A - A I I I A I G A I G A G G ^ A A A ^ A A A T A C T T T A AA A C A A T T G T T G ^ ne_a316"ail.seq 

AAA ATT -T ATG AT GA.G CT AAA T A A ATA CTTTAAAACAATTGTT'GACGAAAA a909 ail.seq 



1551 
5550 



ACATTCTA T T C TTGATGG A A A T G T C ACT C ATCCT ATC C C ACACATC ATTC Majority 



1594 
1367 
1599 
1599 
I6Q1 
{600 



86 lO 8620 .8630 8640 



_____ _ 8650 

AGA " GTA TT G TTGATGG A A AT CT G ACT GATCCTATGGGAGA GATGA TTG 2603 ail.seq 

*n A I I £ I A TT ^ TGA TGGAAATGTGAeTGATCCTATCGGAGAGATGATTC 18rs21 ait seq 

A r A ^^ A J T ^ TTGATGGAAATGTGACTGATCCTATGGGAGAG seq 
f o * II £ l' K T TGTTGATGGAAATGTGAG TGATCCTATCGGAGACATGATTG cjblli aii^eq 

A r A JInJ A 3-^^*^" rGATGGAAA ^ G *^ G A G ^ G ATC-CTATGGGAGAGATGA TT G nem316~ail .sq 
ACATTCTATTGTTGATGGAAATG T-G A..CTGAT.C. CTATGGGAGA- G-ATGATTG a909_a7l. seq 

A A T T C C A A f T A A A A A AT G C T C A A A GT.IT X A CAC.ATGAT G A TT ACCTTTTG Majority 
_ 'fffo- , 8690 8700 

5£ f til?. G A A T ^ ^^A^i^ffe-Tt A A A G T;T T >?T AC AC ATG A T G A TT AC GTTTTG 2603 ail.seq 

A A IIS? T T A- A/AJAVA/A-T^C. f .X C^rA^.p..!. -T: t A ti A -T GATGA T^t Acre t T TT G i8rs21 all^eq 

«49 A A T T C P A A T^M^^t^.t C ^^m^'f^^^A-T^Ai T G M^KG TP T-^^^'^I^seq^ 



S649 
S51 
}650 



„C,T X;C ;G A .AVfc&fr^ A-^A^A^ ^G^.G-T,G,g C _T_^j£_^ T G G A C €. :lia : jbritv 

■ , 8T1Q ; , , : 8720 i • : • 873o:, . 8 740 , .-* 8750 

"5H.5'$ A A :f5' G ^l' : P* ' A ^: G ^^a:A ; T s T /..A A A A > A *:m G T G T G g'ctC^G tCC AGO 2603 ail.seq 

G 1 1 G ? * A G A ISf S A €t G * T T A A A AAATG GTGTG.G C T C T T G G T G C A C C ISrsiLallleq 

T% 31 G G A A AT C A T & ° € A C T G A A ? A A * A A ATG C TGTGGCTCTTGGTG C A C C cohl ail sea 

? II S G A A A T G A T G G G A G T G A A t T A A A A A A T- G G T G T G G C T C T T6G T G G A G C cjblli ail sfeq 

bt't r r i i a It'll ttZ A I T C A A T T A AA A A ATGGTGTGGCTCTrGGTGGACC ne_316Za 
.700 CTTGGA A A TG A T G G. C A G T C A A T T A A A A A ATGG TG T GGCT C T f G G T G C A C C a909 ail.seT^ 
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TTiun^ Ju^S] ^ sZftPM*** ""^ * neUl " Wthod wrth Wfefchted reskJue weight table. 




CT/US2005/027239 



8744 
8517 
8749 
8749 
875 i 
8750 



8794 
8567 
8799 
8799 
8801 
8800 



8844 
8617 
8849 
8849 
B851 
B850 



B894 
B667 
B899 
3899 
$901 
3900 



3944 
J717 
&49 
5949 
1951 
5950 



— n n i ,. 1 " Hy" -■■ it n ,t»»i .{■»). »■»■■ ,■ t».. tt ...ii t > ,-,...{, 

A A A C A G fcm£ TilCG <SU ZwJt T%<-f j^A -if A it; A rl? 



8760 
— I 



8770 
« 



G T T A C A G T G A C T T ATGATAAG A 

— j- 



8780 
l 



— i 

8790 
I 



Majority 



8800 



AAACAGTGATCGGrrAATT^T 

AA^cXG^G^rrrrrMn^ AAAAGATGTT ^ 

AAACAGTGATGGCrrAATTT 

AAACACTGATGGCGGAATTTTAAAAG ATC.TT AC A CT C AC IT A T C A T A A C A 



AAACAGTGATGGCGGAATTTTAAAAGATG T.T A C A G T 



GACTTATGATAAGA 



CATCTCAAACCATCAA.A 

8810 




2603_ail.seq 
18rs21_aii.seq 
cohl_ai i .seq 
cjblll_ail.seq 
nem3 16_al 1 . seq 
a909_ail-seq 



Majority 



CTACTTCTTACCTA T G A 
— — i 

8860 



GGACAAAAA 
GGACAAAAA 
GGACAAAAA 

- - -GGACAAAAA v,juiu_uu.se(i 

CTTAGGAAGTGGACAAAAA nem316_ail seq 

CTTAGGAAGTGGACAAAAA a909_all.seq 



2603^ail.seq 
18rs21_ail.seq 
cohl_aI I .seq . 
cjblll_ail.seq 



TGT 
i 



G 
G 
C 
G 
G 
G 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



8870 
— i 



ACGTTTAAAAGATAA.CTATATAAGTAACA A 

» r - 



Majorlty 



8880 
t 



8890 
— I 



GTTCTTACCTATGATGTACGTTTAAAAGATAACTATATAA 
GTTCTTACCTATGATGTACGTTTAAAAGATAACTATAT A A 
GTTCTTACCTATGATGTACGTTTAAAAGATAACTATATAA 
GTTCTTACCTATGATGTACGTTTAAAAGATAACTATATAA 
G.TTCTTAC CTA.TGATGT AC GTTTAAAAGATAACTATATAA 

CTTCTT ACCTATG ATCT AC GTTT AA A AG AT A ACT AT AT A A 



8900 

I— 



G T A 
G T A 
G T A 
C T A 
C T A 
G T A 



A C A A 
A C A A 
A C A A 
A'C A A 
A C A A 
A C A A 



2603_ail.seq 

18rs21_ail.seq 

cohl__all.seq 

cjblll_aiKseq 

riem316_ail.seq~ 

a909_ail.seq 



ATTTI ACAATA'CAA AT AATCCTACAACCCTAAGJCC G AAGAGTGAAAAAG Ma f oritur 

8940 




•ATTTTA 
A T T T T : A 
AT T T T A 
ATTTTA 
ATTTTA 
ATTTTA 



C A 

C A 

C A 

C A 

C A 

C A 



ATA 
ATA 
A T A 
ATA 
ATA 
ATA 



C A A A T A 
CA'AATA 
CAAA.TA 
C A A A T A 
CAA ATA 
C A A A T A 



A 
A 
A 
A 
A 
A 



T C 
T C 
T C 
T C 
T C 
T C 



G T 
G T 
G T 
G f 
G T 
G T 



A 
A 
A 
A 
A 
A 



C 
C 
C 
C 
C 
C 



A A 

A A 

A A 

A A 

A A 

A A 



C G 
C G 
C G 
C G 
C G 
C G 



C T A A G 
C T A A G 
CT A A G 
C T A" A G 
C f A A G 
C T A A G 



T 
T 
T 
T 
T 
T 



C 
C 
C 
C 
C 
C 



c 
c 
c 
c 
c 
c 



G 
G 
G 
G 
G 
G 



A A C C A A ATACTATTCGTGA TT T 

■ i ■ | 

8960 8970 



8950 

. L 

AAGAGTGAAAAAG 2603_all.seq 
AAGAGTGAAAAAG 18rs2i_ail .seq 
A A C A G T G A A A A A G cohl^ail .seq 
AAGAGTGAAAAAG cjblll_ail .seq 
AAGAGTGAAAAAG nem316_aiKseq 
AAGAGTGAAAAAG a909_ail.seq 



AACCAAATACTATTC 
A A C C A A A T\A ,CTATTC 
AACCAAATACTATTC 
AACCAAATAC T A T T C 
AACCAAATACTATTC 
A A C C AAA T A C T A TJC 



G T G A T TT 
GTGATTT 
G T C A T-T T 
GTGATTT 
GTGATTT 
GTGATTT 



C C C A A T.T CCCAAAATTCGTGATGTTCGT 2603„ail seq 
CCCAATTCC C A AAATTCGf GATGTTCCT 18rs2l ailseq 

^^t ATTCCCAAAATTCCTGATGTTCGT cohi_ail.seq 
CCCAATTCCCAAAATTCGTGATGTTCGT cjbltl ail seq 

CCCAATTCCCAAAATTCGTGATGTTCGT nem316 ail.seq 

CCCAATTCCCAAAATTCGTGATGTTCGT agog^ailiseq 



T CGGTG.ACGTTGA Majority 
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5994 
J767 
J999 
J999 
KX)I 
)Q00 



K344 
J817 
KM9 
K>49 
K)5l 
K)50 



GAG TT T C C GG t A. C T A 
GAG T T:t C C € G- y A C T A 

y-G.T t't'c c g g t:a g £ a 

G'A ;I G T "T. ? T C. P G-*G. J A.fcli 
& A • C T T T C-^e^C^Xv&.CrlT A: 
-VflL4£{£ T T T £ :^G-T v Av&T.^ 

•■ •••• •-.•/, 



ACCATG 
AC e AT c 
• A 'C G A T C 
Ar G'G A T C 



A C T A A T C 
A GT AA T C 
AG TV A A T C 
AG T A. AT C 

^G;-T,.A'A^;q 
AGT.A A T C 



A G A A G 
AG A A G 
AG A AG 

A G .A! A G 
: A f 'G' A A G 

AvGVAiA^G 



A A A A T G : G G T G A G G 
AAA A f s G G C .T.G A fe .'G 
A A A A f C G G T G A-'G G 
A ; AYA A T G G G; T G - A G . G 

A A : A; A T C^G :G.:£l&fe&& 
A- A A A T G G G T^G .A G^G 



TT G A 2603_ail.seq 
T T G A ISrs^I^aii.s'eq 
T T G A bohlliit i i .'seq ; 
T T G A csJb^llLiil -seq 
;T.T"'0" : A ,^3Jt6^aii ;seq] 
T"T -Q^'Sc^Ltiia^r"'. 



A T T T A T T G^T TiA , A^T A A A G II C A A A.C-A T T C A G A A t C C C T T T T G G G A C C T A M^i^ 




A Tar.T A T T A A A G T 
A T T T A T T A- A- A G t 
A T T T ATT A A A G .T 
A T.TT A T T A A A G T 
A T .T T A t T*A -A A G T 
A TT T A T T A A A G T 



T A 
T A 
T A 

t.A 
T A 
f A 



A 
A 
A 
A 
A 
A 



T AAA G A 

T A A AG A 

T A A A G A 

T A A A G A 

T A A A G A 

T A A A G A 



C 
C 
C 
C 
C 
C 



AAA 
AAA 
AAA 
AAA 
AAA 
AAA 



CAT 
C.A T 
CAT 
CAT 
CAT 
CAT 



T C AG A 
T C AG A 
•TCAG A 
T C A G A 
T C A G A 
T C AG A 



A T G G 
ATCC 
A T.C G 
A T C G 
AT C G 
A T C G 



C T T TT G 
CTTTTC 
C T TTT G 
CTTTTG 
CT.TTTG 
CTTTTG 



G G AG C TA 
G G A G C TA 
G G A G C T A 
G G A G CT A 
GGACCTA 
G G A G C T A 



2603_ail.seq 
18rs21_aii .seq 
eohl_^li.seq 
cJblll_alKseq 
.nem3 16_.aH , seq 
a909_ L ail.seq 
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9094 
8867 
9099 
9099 
9101 
9100 



A G T T T C 

A G T T T C 

A G T T T C 

A G T T T C 

A C T T T C 

A G T T T C 



A 
A 
A 
A 
A 
A 



ACTTCACATAG 
ACTTCAGATAG 
A C T T C A G 
A C T T C A G 
A C T T C A G 



A C T T C A C 



A T A C 

A T A G 

A T A G 

A T A G 



A A 

A A 

A A 

A A 

A A 

A A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 

A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



ATTTTTCTGG GTAT 
ATTTTTCTGG GTAT 
ATTTTTCTGG GT 
ATTTTTCTGG GT 
ATTTTTCTGGGTAT 
ATTTTTCTGGGTAT 



A T 
A T 



AAGCAATTTGTT 
AAGCAATTTGTT 
AAGCAATTTGTT 
AAGCAATTTGTT 
AAGCAATTTGTT 
AAGCAATTTGTT 



9144 
B917 
9149 
9149 
9151 
9150 




C 
C 
C 
C 
C 
C 



C A 
C-A 
C A 
C A 
C A 
C A 



G 

G 
G 
G 
G 
G 



A 

A 
A 
A 
A 

A 



G G 

G G 

G G 

G G 

G G 

G G 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



T G 

T G 

T G 

T C 

T G 

T G 



T T A 
T T A 
T T A 
T T A 
T T A 
T T A 



C 
C 
C 
C 

c 
c 



A 
A 
A 
A 
A 
A 



A 

A 
A 
A 
A 
A 



C 
C 
C 

c 
c 
c 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



GAATGATGGTAA 
GAATGATGGTAA 
GAATGATGGTAA 
GAATGATGGTAA 
GAATGATGGTAA 
GAATGATGGTAA 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



T 
T 
T 
T 
T 
T 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



T 
T 
T 
T 
T 
T 



T 
t 
T 
T 
T 
T 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



2€03_a i 1 . &eq 
18rs21_aii.seq 
cohl_a I 1 /seq 
cjblll_ ail.seq 
nem316_ai 1 .seq 
a909_ail.seq 



Majority 



2603_all.seq 
18rs21_ail.seq 
cohI_ail .seq 
cjblll_ail .seq 
nem316_al 1 .seq 
a909__ail.seq 



AG CACTTCAAGATGGTAACTATAAATTATATGA AATTT 



»194 

S967 
U99 
H99 
)201 
>20O 



i 

9210 
I 



ACC A CTTC A AC ATGG 
AGCACTTCAAGATGG 
A G:C A .CTTC A AGATGG 
AGCACTTGAACATCG 
AGCACT.TCAAG AT G G 
AGCACTTCAAGATGG 



9220 
■ 



T A A 

T A A 

T A A 

T A A 

T A A 

T A A 



C T 

C t 

C T 

C T 

C T 

C T 



— i — 

9230 
, i ._ 



C A A 
1 



A 
A 
A 

A 
A 

•A 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



A 
A 

A 
A 
A 
A 



A T 
A T 
A T 
AT 
A T 
A T 



T A 
f A 
T A 
T A 
T A 
T A 



T A T G 

T A T G 

T A T G 

T A T G 

T A T G 

T A T G 



9240 
i 



GTCCAGATG Majority 
. 92SO 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



T 
T 
f 
T 
T 
T 



T T 

T T 

T T 

T T 

f T 

T T 



C 
C 
C 
C 
C 

c 



A 
A 
A 
A 
A 
A 



A 
A 

A 
A 
A 
A 



GTCCAGATG 2603_ail.seq 
GTCCAGATG 18rs21_all .seq 
GTCCAGATG cohI_aii.seq 
C T C C A G A T G cjblll^ail.seq 
GTCCAGATG nea3I6_ail .seq 
GTCCAGATG a909_ail .seq 



)244 
KM 7 
>249 
1249 
J251 
1Z50 



C C T A T A T A G A G G T T A A 

— P 1 . 

9260 

«• - 



AACGAAACCTG TTGTG A 
I - — . : 1 = 



CAT TTACAATT 



G C T A T A T A CACGTT 
CCTATATAGAGGTT 
G C T A T A T-A G A G G T T 
G C T A T A TAGACGTf 
GCTATATAGAGGTT 
GCTATATAGAG GT T 



9270 
« 



9280 



C A A A AT 
" ' i ■ 



Majority 



9290 
l 



AAAACGAAACCTGTTGTGACATTTAC 
A A A A C G A A ACCT. GTTGTGACATTTAC 
AAAACGAAAC C.T GTTGTGACATTTAC 
AAAACGAAACCTGTTGTGACATTTAC 
AAAACGAAACCTGTTGTGACA T T T AC 
AAAAGGAAACCTGTTGTGAGATTT AC A 



A 
"A 
A 
A 
A 



ATT 
ATT 
ATT 
ATT 
ATT 
ATT 



C 
C 
C 
C 
C 
C. 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 

A- 
A 



9300 

1^. 

A T 
A T 
A T 
A T 
A T 
A T 



2603_all.seq 
I8rs21_ail;seq 
cohl_ail .seq 
cjblll_all.seq 
nem3 1 6^a i 1 . seq 
a909_aii.seq 



294 
067 
299 
299 
301 
300 



CCAGAACTTACGAACCTGAAAGC A.'G AT CCAAAT.CCTAATAA.AA AT C A A A T Majority 

9310 9320 9330 9340 ^9350 



55^^ !II^^M CCTGAAAGCAGATCCAAAT ' GCTAAT AAAAAT.CAAa} 2603 ail.seq 
GG^C^A^T^Ar^A^^ 18rsil_ai l^eq 

A • I " A ACCTGAAAGCAGATCCAAATGCTAATAAAAATCAAAT ne^iolail .s^ 

GGAGAAGTTACGAACCTGAAAGCAGATCCAAATGGTAATAAAAATCAAAT a909_ail.seq 



344 
117. 
349 
349 

350; 




C G. G .G T A T 
CGGGTAT 
C .G .G G T A T 

C G.G G TAT 
£ G .6 G T - A T 
C G GG TA t 



C.T. T G A A G G A A A T'G G 
-C TTG A AGG 'A A A T G G 
C T f G A. A G G A A A T G G 
€ T t G A A G G AAA T G: G* 
C T T A A AG G A A A T GG 
C|T.GA A G Gf;A:A AXG G: 



T A A A C A 
T A A A C A 
T A AA.C A 

T A A A.C A 
T: A A: A C A 

T A'-A A- C A 



TCTTATT 
T C T- t A XT 
T C T.T'A T T 
T C. TvT A T T 

T- 6 W'TUkftjT A)Clfe" Ak'A t&X 



ACCAACA 
A C C A" AC A 
A G;' C- A A C A 

A jC;C: A ArC 'A 



C T 
C T 

et 

C T 

>C"£fr 



C C C A A A C 2603_ail.seq 

C C C A A A C 18rs21_a i lVseq 

C C C A A. A C ephl^ail.seq : 

G G;C A A A C cjblt lLaXliS^i 

,C;;c^ A a; a c^ h^mj&ii >seq/ 

C' C C A A A C a^i.atk's^j.- 



G C C C ACC A G G T G T T T T. T C .CT A A A A C AvG 
> • ^1 ' : : — . I . 1 r- — : 



G G .C rG IA A T 



J94 
167 
599 
399 
(01 
«X) 



G C C C ACC A G 
CC.C C A CCA C 
G C C C AC C'A G 
GCCCACCAG 
GCC C A.C C A G 
C C CC A C C A G 



9410 



9420 
. — I 



T.X?. G T. A- C A^A T -T G T C. T A T Majority 



9430 
t— - 



CTC TTTTTCC T A A A AC A G.GGG 
G JT GTTTTTCCTAAAACAGGGG 
GT G TTTTTCCt A A A AC AGGGG 
GTGTTTTTCCTAAAACAGGGG 
CTGTTTTTCC T.-A AAACAGGGG 
GTGTTTTTCCTAAAACAGGGG 



9440 
i_ 



T 
9450 



GAATTGCTA C A AT 
G A ATTGGTA C A AT 
G A AT T GCT A C A A T 
GAATTGGTACAAT 
GAATTGGTACAAT 
G A^ A T T G G T A C A A f 



t G T C t A T 2603_ail.seq 
TGTCTAT 18rs21_aii;seq 
T. G TCT A T cohl_ail.seq 
T G T C T A T cJblH_all.seq 
TGT CTAT ne«316_ail.seq 
TGTCTAT a909_ail .seq 
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9444 
9217 
9449 
9449 
9451 
5450 



H94 
J267 
>499 
M99 
*50l 
)5O0 



1544 
G17 
1549 
1549 
1551 
)550 



'594 
367. 
599 
599 
601 
600 



644 
417 

549 
549 
651 
650 



394 
167 

399 
599. 

rot 
roo 



jjTTACCATTTGTTCTTTCCGTCG Majority 
9480 9490 9500 



ATATTAGTTGCTTCTA C TTTT A T C AT A CTT A CC ATTT CTTCTTTC CCTC G 
^!^!HIS GTTCTACTTTTATGATACTTACCATTT CTTCTTTCCCTCC 

ATATTAGTTGGTTCTACTTTTATGATACTTACCATTTGTTCTTTCCGTCG ciblTt *iT\L, 

ATAUACTTGCTTCTACTTTTATGATACTTACCATTTGUCUTU 
A 1 A T T A G "~ """ """ ~ "" ~ ' 



2603_alKseq 
18rs21_ail .seq 



nem316_ai i -seq 



T44 
U7 
T49 
M9 

rsi 

ted 



TTGGTTCTACTTTTATGATACTTACCATTTGTTCTTTCCGTCG a909_ail.seq 



TAAACAATT GTAAGCTG 



T 

9510 
i 



TGCTTGAA ATTATTAA 



A T AATAGAAAAT 



T.A A 
T A A 
T.A A 
T A A 
T A A 
T A A 



A 
A 
A 
A 
A 
A 



C 

c 
c 
c 
c 
c 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



T 
T 
T 
T 
T 
T 



G T 
G T 
G T 
G T 
G T 
G T 



A 
A 
A 
A 
A 
A 



G 
C 
G 
G 
G 
G 



9520 
— i 



9530 



G A C T A 
— r- 



Majorlty 



G 
G 
G 
G 
G 
G 



T G 

T G 

T G 

T G 

T G 

T G 



T 
T 
T 
T 
T 
T 



C 
C 
C 
C 
C 
C 



G 
G 
G 
G 
G 
G 



TTGAAATTA TT A 
t T G A A A.TT ATTA 
TTGAAATTATTA 
TTGAAATTATTA 
TTGAAATTATTA 
TTGAAATTATTA 



9540 



9550 
t 



A 
A 
A 
A 
A 
A 



A T 

A T 

A T 

A f 

A T 

A T 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



T A 

T A 

T A 

T A 

T A 

T A 



C 
G 
G 
G 
C 
G 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A T 

A T 

A T 

A T 

A T 

A T 



A 
A 
A 
A 
A 
A 



C T 
C T 
C T 
C T 
C T 
C T 



A 
A 
A 
A 
A 
A 



2603_ail.seq 
I8rs21_ali .seq 
cohI_all.seq 
cjblil_ail.seq 
nea3 1 6_a 11 . seq 
a909__ail.seq 



GJT T T G G C 



C T T T C 
1 



CCTAT T G 

: j~ 



TCAGTCAG 



9560 
■ 



A T T A 
— t 



G T T 



GTTTTGGC 
GTTTTGGC 
GTTTTGGC 
G T TT T G G C 
G T TTTCGC 
GTTTTGGC 

ATGCC A A T 



CTT 
CTT 
CTT 
CTT 
CTT 
CTT 



T 
T 
T 
T 
T 
T 



C 
C 
C 
C 
C 
C 



C 
C 
C 
C 
C 
C 



9570 
i 



ATTATCAAGCTTC 

1 ; 



c 
e 
c 
c 
c 
c 



T A 
T A 
T A 
T A 
T A 
T A 



9580 



T 
T 
T 
t 
T 
T 



T 
T 
T 
T 
T 
T 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



C 
C 
C 
C 
C 
C 



A 
A 
A 
A 
A 
A 



G T 
G T 
G T 
G T 
G T 
G T 



C 
C 
C 
C 
C 
C 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
C 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



T. 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



9590 
i 



G 
G 
G 
G 
G 
G 



TT 
T T 
T T 
T T 
T T 
T T 



A 
A 
A 

A 
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10499 TAAAGGTCGTTATAATAGCGGAGCTCATCTGAGAAAACTTTTA'TACCTCA cjbUL_ail seq 
10501 T A A A G G T C C T T A T A A T A C C C G A G C T C A T C T C A G A A A A C T T T T A T A C C T C A nem316 ail'seq 

10500 TAAAGCTCCTTATAATAG'CGGAGCTCATCTGAGAAAACTTTTATACCTCA a909_ail.se< ? 

AA.CTCACTCTAGCTTTCATATCATCAAGCCATTACGACTTATTCCTTATC Majority 



IOS44 
10317 
10549 



10560 10570 10580 1O590 10600 
J «— 1 — . — ■ -i t 



A-A ■GTCAGTCTAGCTTTGATATCATCAAGCCATTAGGAGTTATTCCTTATC 2603_aii seq 
AACTCAGTCTAGCTTTGATATCATCAAGCCATTAGGACTTATTCCTTATC 18rs2I_jail^eq 
AAGTCAGTCTAGCTTTGATATCATGAAGCCATTAGGACTTATTCCTTATC cohl.all.seq 

10549 AAGTCAGTCTAGCTTTGATATCATGAAGCC A.T T A G G A G T TATTCCTTATC cjblll ail seq 
10551 AAGTCACTCTACCTTTGATATCATCAACCCATTAGGAGTTATTCCTTATC nekU6_aiI *seq 

10550 A AGTCAGTCTAGCTTTGATATGATGAAGCCATTAGGAGTTATTCCTTA TC a909_aii.seq 

TTTTACTGGCCCGCCATCCATATACTGATAGATCCAGATATTTACATCCA Majority 
10610 10620 10630 10640 10650 

10594 TTTTAGTGGCGCGC.GATCCATATAGTGATAGATCGAGATATTTAGATCCA 2603_ail seq 

10367 TTTTACTGGCGCGCGATCCATATACTGATA GAT CCAGAT ATTTAGATCCA 18rs2i ailseq 

IOS99 TTTTAGTGGCGCGCGATCCATATAGTGATAGATCGAGATATTTAGATCCA cohl_aU seq^ 

10599 TTTTAGTGGCGCGCGATCCATATAGTCATAGATCGaGATATTTAGATCCA cjblH_ail^eq 
10601 T.TTTACTGGCGCCCGA TC C ATATAGTGATAGATCGAGAT ATTTAGATCCA ne*316_all .sea 

10600 TTTT. A GTGGCCCGCGATCCATATAGTGATAC A TCGAGAT ATTTAGATCCA a909_all.seq 

A A A G T T CT A TGATCCTCTTTTGGCGCCT-TTTTTCCACCAGATAATATTAA Majority 
. 10660 10670 10680 10690 10700 

10644 A A A G T T C T A T C A T C C T C T X T T G G C G C C T T T T T T C C A G C A G A T A A T A T T A A 2603_ail.seq 
7 A .A AG TTCTATC 

A T C C T C T T T T G G C G C C T T T T T T C C A G C A G A T A A T A T T A A 18rs21_ail.seq 

10649 AAAGTTCTATCATCCTCTTTTGGCGCCf TTTTTCCAGCAGATAATATTAA cbhl_ail.seq 

10649 A.AACTTCT A TC ATCCTCTTTTGGCGCCTT TT TTCCAGCAGATAATATTAA cjblll_ail7seq 
10651 AAAGTTCTATCATCCTCTTTTGGCGCCTTTTTTCCAGCAGATAATATTAA ne*316_ail .seq 

10650 AAA G'T TC T A T C A T C CT C T TT TGGCGCCTTTTTTCCAGCAGA'TAATATTAA a909_all.seq 

G C T A C C T T G G T C T A A C A A C T C C A G C A C T T T A T T T A C A C C A C C T A T T A A T G Majority 
. 10710 10720 ■ 10730 10740 10750 

.0694 CGTAGCTTGGTCTAACAACTCCAGCACTTTATTTACACCACCTATTAATG 2603_all.seq 
0467 GGTAGCTTGGTCTAACAACTCCAGCACTTtATTTACACCACCTATTAATG 18rs21_all.seq 
- CCTAGCTTGGTCTAACAACTCCAGCACTTTATTTACACCACCTATTAATG cohl_ail.seq 

GGTAGCTTGGTCTAACAACTCCAGCACTTTATTTACACCACCTATTAATG cjblli_ail.seq 
GGTAGCTTGGTCTAACAACTCCAGCACTTTATTTACACCACCTATTAATG nem316_ail.seq 
0700 CGTAGCTTCGTCTAACAACTCCAGCACTTTATTTACAGCACCTATTAATC a909_ail.seq 

C A A A C T A C A C G A C T G A G A TTC AAGCTAT. T ' G G G A CAACCATT A:. A G T C A C A A Majority 

:1O760 10770 10780 10790 • 10800 

0744 C A A A C T A C A G. C A C T G A C A T T C A A G Q T A T T; G,G C A C A A C G A T T A A G T C A C A A 2603^atl.seq 
0517 C AAA C T A C A C C A £ J^k^jr^;J^K^%Ayt Tr^fc-jfr*. C -A. A€> AT t'A'A G .T C A C A A* I8'rs21^.ail.seq 
0749 C A -A ACT AC A G G &[&x:G AG AT't'-C A^A./GsC <ir'A 'ifi g4 : G a- ir; a*a Mzt a .-r v a . a .A a ^ * a 
07.49 
075T 

OTSQr .^v/a^av^s Ave SA^.C G. A : "T^T;AvA* G T .C A C A •A' a909_ail;s< 

■ ■ ' - "*'• "■ 1 •* ' ' : . '"*".■•"•*; " * • ' ■. ;;'.* ' . '■ •" ' •-:'.-*'* • •.. . - "■" 

^fr/foC;^ A^S ^ GTf GAG Majority " 

-' • IP**! 0 . , . ; 10820 r 10 ! 30 . 10840 . . 10850 

0794 A T.ar C C G G A A T C G A t T T T G A CG G T T A C G G A T A A A A A G A G G A G O A A G T T C A G 2603_;atKseq 

0567 ATT C C G G A A T C G A T T T T G A C G G T T A C C G A T A A A A A G : A G C A G G A A G T T C A G 18rs2i_aiirseq 

0799. A T7T C COCA A TC G A T t T T G A C.QG T T A C 0 C AT A A A A A G A G C A G G A A G T T C A G coKl_all.seq 

0799 ATTCCGGAATCGATTTTGAC G£ J T A C G G AT A A A A A G A G C A C G A A C T T C A G cjbUl_aiKseq 
0801. AT.TCCCC A AT C G A TTT T.GACG|]t T A C G GAT A A. A A A G A G C A G G A A G T T C A G ne«316^ail.seq 

0800 ATTCCG.GAATCGATTTTGAC G G TT A CGG A.T AAA A AG A G C A G G A A GTTC AG a909_aU.seq 



.0699 
0699 
0701 
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18rs2i„ail_s< 
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GAG 
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GAG 
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2603_all.seq 
18rs21_ail.seq 
oohl_all.seq 
cjblll_ail.seq 
nem316_ail.seq 
a909_ail.seq 



T TC A Tt.C CTGTCAGCAAA 

10960 



10944 
10717 
10949 
10949 
i0951 
O950 



TTCATTCCTGTCA 
T TC A TTCCTGTC A 
T T-C ATTCCTGTCA 
TTCATTCCTGTCA 
TTCATT C„C T G T C A 
T T C A T T C C T G T C A 



CACACGACAGT CGGTCGGACAAC CCTTACCTT Majority 
10970 10980 10990 IIOOO 



G C A A 

G C A A 

G C A A 

G C A A 
C'CAA 

G C A A 



A 
A 
A 
A 
A 
A 



G A 
G A 
G A 
G A 
G A 
G A 
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CCGA 
CCGA 
CCGA 
C G G A 
C G G A 
C G G A 



C 
C 

c 
c 
c 
c 



A 
A 
A 
A 
A 
A 



A C 
A C 
A C 
A C 
A C 
A C 



CCTTACCTT 
CCTTACCTT 
CCTTACCTT 
BCTTACCTT 
CCTTACCTT 
CTTAC C T T 



2603_ail.seq 

18rs2l_ail.seq 

cohl_ail.seq 

ojbltl_ail.seq 

nem316_ail .seq 

a909_alKseq 



T G A CA ACCTTAAACCTGGATTT T A T G A 

■ • — ■ 



npio 



c c T 

r- 



. 11020 



TAA AG AAACGAAAGCGCCGA Majority 



0994 
0767 
0999 
0999 

iooi 

1O00 



T G A C A A C C T T A A A C C 
.T.G A C A A C CT T A A A C C 
T G AC AACCTTAAACC 
TCAC AACCTTAAACC 
TGACAACCTTAAACC 
TGAC AACCTTAAACC 



T G 
T G 
T G 
T G 
T G 
T G 
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G 
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G 
G 
G 
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A 
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11030 



TTTT 
T T T T 
TTTT 
T.TTT 
TTTT 
TTTT 
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C 

c 
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c 
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c 



T T A A 

T T A A 

T T A A 

T T A A 

T T A A 



11040 
1 



r 

11050 
1_ 



C.TTAA 



AGAAACGAAAGCGCCGA : 2603_ail.seq 
AGAAACGAAAGCGCCGA 18rs21_aiKseq 
AG AAACGAAAGCGCCGA cohi_all.seq 
AGAAACGAAAGCGCCGA cjbllt^ail.seq 
AGAAACGAAAGCGCCGA nem3i6lail .seq 
AGAAACGAAAGCGCCGA a909_ail.seq 



ATGCTTACGTACTTGATCCTAA CACTI* 



ATGTTGTGGTCGTTCA A A A T 



1044 
3817 
1049 
1049 
I051 
IO50 




A 
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A 
A 
A 
A 



T G C 
T G C 
T G C 
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T G C 
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T 



T 
T 
T 
T 
T 
T 
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C G T A 
C G T A 
C G T A 
C G.T.A 
C G T A 
C G T A 



C T T G 
C T T G 
CTTG 
CTTG 
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CTTG 
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A T 
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A T 
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C T T 
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C TT 
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T G T 
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T G T 
T G T 
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G 



T 
T 
T 

X 

t 

T 



G G T 
G G T 
GG T 
G G T 
G G T 
G G T 



C G T T 
C G T T 
C G T T 
C G T T 
CGTT 
C C T T 



C 
C 
C 
C 
C 

c 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A. 



A T T C 
AT'TC 
A T T C 
A T T C 
A T T C 
A T T:C 



Maj ori ty 

mod 
j_ 

2603_ail.seq 
18rs2I_ail.seq 
cohI_ai l.seq 
cj bl lX_ai 1 -seq 
nem3 10_a 1 1 . seq 
a909_ail,seq 



A 
A 
A 
A 
A 
A 



G G A A 



A A A C G A C A A T T CT C G A TC A A G C T A A CTTCAAAGAGGC TC A T 

in io ■ 4 ' 



Majority 



1094 
>867 

fQ99. 
LO90 

uoi- 

liQOv 



G 
G 
G 
G 
G 



G A A 

G* A A 
C A A 
G A A 

; .-jA: A- 



A A A C 
AAA C 
A A AC 
A A AC 
A A AC 

i.-A-s-fe' 



G A C A A T T G 
G.A C A AT T G 
GAG A A-T T;G 
C A C A A TT C 



T G G A T G 
T G G.A T G 

T; 'G aG A-T..G, 
t C- G A t G 



11140 

A AGCTA AC T t ,C A A A G A G C C TC AT 
A AG C T A. A C T T C A A A G A G G C T.G A T 
A AG C f A AC T T C A A A G A G G C T G A T 
A A G C T A A C j f C A A A G A G G C T.G A T 

.A;i:G^T^;A;GtfCAAiA:C A G'GC.TG A-T 
JP^/A t ••T:-t l 'C A A A G A G C CT C A^T 



71 A T G G C t ^iA^T^ cfe^C A ^ G t^ G A G T G C G f A G C 



TACCG 2603_ail. seq 
T A C-C C i8rs21_ail v seq 
T A C C G cohLail^seq 
T A G C C- cjilil^all.seq 

ftyk>£ G;C-he«^i^i^seq> 

•T- A.C~tC'-C\^a90feaiE'^* • 



.144 
J917 
149 
149 
151 
150 




G T T C C XA - C A 
11190 



AC G 



A A Mailority 
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C T A 
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CT A 



G C 
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G C 

G C 

G C 
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G 
G 



T TC C T A C A AC C 
T T G C T A C A A C G 
TTCCT A C A AC G 
TT G C T A C.A A C G 
TT.GCT A C A AC G 
TTCCTACAACG 



A A 2603_ail.seq 

A A 18rs21^.ail.seq 

A A cohl^all.seq 

A A cjblll_ail.seq 

A A nejn316_ail.seq 

A A a909_ail;seq 
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TTTTCAAGCATTTTGTCT 2603_ail seq 
TTTTTCAACCATTTTGTCT 18rs21_ail .seq 
TTTTTCAAGCATTTTGTCT cohl_aii seq 
T T T T T CAAGCATTTTGTCT cjblH„ail .seq 
TTTTTCAAGCATTTTGTCT nem316_ail .seq 
TTTTTCAAGCATTTTGTCT a909_ail .seq 



TTATTC AAA AC ACTC AT TTTAACATAAA A A AC G T A T T A A AAAACATA 



11244 
11017 
11249 
11249 
11251 
11250 



11260 

L 



TTATTGA AAACAGTC 
TTAT.TCAAAACACTC 
TTATTC AAA A *G A G T G 
TTATTC A A A AG ACT.G 
TTATTGAAA AG AGTG 
TTATTGA A AAGAGTG 



11270 
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H280 



ATTTTAACATAAA A 
ATTTTAACATAAAA 
ATTTTAACATA AAA 
ATTTTAACATAAAA 
ATTTTAACATAAAA 
ATTTTAACATAAAA 



11290 
t 



T T G 
r 



Majority 



11300 

_I_ 



AAGGTATTAA A AAA C ATAT.TG 2603_ail seq 
A A CC TATTAAAAAACATATTG 18rs21_ail .seq 
AAGGTATTAAAAAACATATTG 
AAGGTATTAAAAAACATATTG 
AAGGTATTAAAAAACATATTG 
AAGGTATTAAAAAACATATTG 



oohl_ail .seq 
cJbUi_aii.seq 
nem316_a i I .seq 
a909_ail.seq 
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GTTT 
GTTT 
GTTT 



GTTTTGAA 
GTTTTGAA 
GTTTTGAA 
GTTTTGAA 
G T T T T G A A 
GTTTTGAA 
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G 



A T 
A T 
A T 
A T 
A T 
AT 



A 

A. 

A 

A 

A 

A 



G 2603_all.seq 
C 18rs21_all.seq 
G cohl_ail.seq - 
0 cjoIll_ail.seq 
G riem316_ail.seq 
G a909_all.seq 



T C A GAT.A 



A A T T T 

!— | 



CTCGCATT ACGAGA 



11344 

11117 

11349. 

11349 

[1351 

11350 



11360 
I 



11370 
» 



AC ATT.TTTACAC.TC T T 
i i 



C T 



T C A G A T A A A T T T C T G G C A T T A C G A G 
TCAGATAAA.TTTCTGGCATTACGAG 
TCA.GAT A A ATTTCTGGCATTACGAG 
T C A-G A T'A AAT.TTCTGGC ATTACG AG 
TCAGATAAATTTCTGGCATTACGAG 
TCAGATAAATTTCTGGCATTACGAG 



11380 
i 



A 
A 
A 
A 
A 
A 



ACAT. TTTTAGA 
A C A T T T T T A G A 
ACATT T.T-T A G A 
A C A T T TO T A G A 
A C.A T T T T T A G A 
A C.A T T tH t A G A 



11390 



CTTTTT Majority 
U400 



G 
G 
G 
G 
G 

G T G T 



T 
T 
T 
T 
T 



G T 
G T 
G T 
G T 
G T 



TCTCTTTTT 
T C T C T T T T T 
TCTCTTTTT 
TCTCTTTTT 
TCTCTTTTT 
TCTCTTTTT 



2603_all.seq 
I8rs21_ai 1 .seq 
coh l__a 11. seq 
cjbHl_ail.seq 

ne»3 16_al 1 „ seq 
a909_a 1 1 . seq 



T T A G T T T 



ACGG A'GG'A AAAATATATAT 
. ■ » : "—t 



11410 



G G A A 
r 



A A A C A 



11420 



.1394 
.1167 
.1399 
1399 
1401 
1400 



TTAGTTTACGGAG 
TTAGTTTACGC AG 
TTAGTTTACGGAG 
TTAGTTTA C G GAG 
TTAGTTTACGGAG 
TTAGTTTACGGAG 



11430 
1 



G C A T TC A 
""" • 



GAAAAATATATATGGAAA 
GAAAAATATATATGGAAA 
GAAAAATATATATGGAAA 
GAAAA.ATATATATGG A. A A 
GAAAAATATATATGGAAA 
GAAAAATATATATGGAAA 



A 
A 
A 
A 
A 
A 



11440 

L 



CGAGTTCT Majority 



11450 



A 
A 
A 
A 
A 
A 



C 
C 
C 

c 

C 
C 



A 
A 
A 

A 
A 
A 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



A T 
A T 
A T 
A t 
A T 
AT 



T C 
T C 
T C. 
T C 
T C 
T C 



A 
A 
A 
A 
A 
A 



CGAGTTCT 2603_ail.seq 
CGAGTTCT 18rs21_aiL.seq 
CGAGTTCT cohl_ail.seq 
CGAGTTCT ejbill_ail.seq 
CGAGTTCT nem316_ail,seq 
C.G A G T T C T a909_ail.seq 



C A T C C A T T G G G A GCCGAA T T CTGGGG AC 

i- ' r~ . - ■ 



A A G 
r— 



1444 
1217 : 
1449 

1449 
1451 

1450 



11460 
l : 



C A TC C A T T G G G A G G G G A 
C.A T C C A T T G G G A G G G G A 
C A T C Jc .A T T 0 G G A C G G G A 
; G A T CtGAT-T G G G A G G G G A 
G A T PviC A T T G G G i A G G"G G.;'A 
C A T C C A T T G G .feVG^ G G G A 



11470 
—i 



C T C A T T G A A C A C 
: " — i 



C A A A C,C A Majority 



A T TC T G 
A'TTCTG 
A Tj CTG 
ATT CTG 

A,T T' C T^G 
•A T T' C T G 



11480 



11490 



G G G \A C 
G G G A C 
G G ;G A C 
CfG AG 
G Gv G^A C 
G G -G Aie 



A A G C T C A.T T G 
A A GlG T jC A T.t G 
.A A G.G T C A T T G 
A AC C t c A T T G 
A: A G CT C A T >T G 
•A A jfi C jc A T T G 



11500. 



A A C A C 
A A C AC 
A A C A C 

A A e A C 

A A C":A"'C 
A A C A;C 



C A. A 
C A A 
C. A A 
C A A 

e A-i 

*C A A 



A C C A 2603__ail.seq 

A CCA t8rs2ljai l^seq - 

A OCA cpit l jaii .seq 

A C: C .A ' '^kjL^kj^i 

A, 0 ;Ci A a9C^laii - seq 

.» • • * * . 



G C G C A. A C C-C G G T G G ■•.T A t T A CCA 



1494 
1267 
1499 
1499 
1501 
1500 



11510 
i 



T 

li520 



A C T C GAT C.G ,T A G C T T T A C,T C A 

-• : • m ■ ■ — 1 — — 



A' C^ AcA A A ^Maio^itv 



11530 



11540 



G C G C:A A C.G G G G f G G T A C T'A CCA 
GCCCAACGGGG T,p GTACTACCA 
G C G C - A. a|]g GG G T(J GTACTACCA 
G C GC AACCGGGTGGTACTACCA 
G C G C.A AC G-GGGTG GTACTACCA 
G C G C A AC G G GG TGGTACTA CC A 



ACTC GATCCT AGCTTTA 
A G T C G.A TCGTAGCTTTA 
A G T C GA T C G T A G C T T T A 
A G T C G A T CGT A GC T T T A 
A GTCGATCGT AGCTTTA 
ACT. C GATCGTAG CTTTA 



11550 



G t C A A CCA AAA 2603l.aii,seq 
G TC A A C C A AAA 18rs21_ai 1 .seq 
G T C A A C C A AAA cohl_all .seq 
GTCAACC.AAAA cjblU_all,seq 
GTCAACCAAAA nen316_ail.seq 
G T C A A C C A A A A a909_aU.seq 
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Gift 



%7 



^^K.I^S^^M 20067 - 1 8 18,h0d ^ Wei3t,ted resWue we^M table. 



•p 



CT/US2005/027239 



CGG G A A C 



11544 
113 17 
11549 
11549 
11551 
11550 



C 
G 
G 
C 
G 
G 



11560 
1 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



A 

A 
A 
A 
A 
A 



C 

c 
c 
c 
c 
c 



c 
c 
c 
c 
c 
c 



G 
G 
G 
G 
G 
G 



C 
C 
C 
C 
C 
C 



c 
c 
c 

c 
c 
c 



c 
c 
c 
c 
c 
c 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



11570 
1 



A 
A 
A 
A 
A 
A 



T G 
T G 
T G 
T G 
TG 
T G 



T 

11580 
— « „ - 



TAGAAAGTGTCC 

i 



GTAATGAT 
p- 



Majority 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



C 
C 
C 
C 
C 
C 



C 

c 
c 
c 
c 
c 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



G 
C 
G 
G 
C 
G 



C 
C 
C 
C 

c 
c 



ACT 
ACT 
C T 
C T 
C T 
C T 



A 
A 
A 
A 
A 
A 



C 
G 
G 
G 
G 
G 



A 

A 
A 
A 
A 
A 



11590 
1 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



G 
G 
G 
G 
G 
G 



11600 

tL 



T 
T 
T 
T 
T 
T 



C 

C 

c 
c 
c 
c 



c 
c 
c 
c 
c 
c 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



T 
T 
T 
T 
T 
T 



G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 



2603__all.seq 
I8rs21_aiKseq 
cohl_all.seq 
cjblll_ail.seq 
neo3 16_a i 1 . seq 
a909_a i 1 . seq 



11594 
L1367 
11599 
U599 
11601 
11600 



TCGATTTCGGGCGC TGATCA TGTCATGGGTTATGCTTATAGCAAATGTAC Majority 

11610 "620 U630 ~ U640 U65 Q 



TCGATTTCGGGC 
TC GATTTCCGGC 
TCGATTTCGGGC 
T C G A T T T.C G G G C 
TCGATTTCGGGC 
TCGATTTCGGGC 



GGTGATGATG. TCATGGG 
GGTGATCATGTCATGGG 
GGTCATCATGTCATGGG 
GGTGATCATGTCATGGG 
GCTCATGATCTCATCGC 
GGTGATGATGTCATGCC 



TTATGCTTATAGC 
TTATGCTTATAGC 
TTATGCTTATAGC 
TTATGCTTATAGC 
TTATGCTTATAGC 
TTATGCTTATAGC 



A A 

A A 

A A 

A A 

A A 

A A 



A T G T A C 
A T G T A C 
A T C T A C 
ATGTAC 



2603_ail.seq 
18rs21_ail.seq 
cohI_ail .seq 
cjb!U_ail-seq. 
ATGTAC neai316_ail.seq 
ATGTAC a909_aiLseq 



TTGGGG 



A G T T G C 
1 



CGCACGAATT 
. — ] 



1644 
1417 
1649 
1649 
1651 
1650 



11660 
i 



AATCAGTG G G A 
i 



C T G A A A 



11670 



C T C 

T 



TTGGGGAGTTGCGGCACGAATTAATCAG 
TTGGGGAGTTGCGGCACGAATTAATCAG 
TTGGGGAGTTGCGGCACGAATTAATCAG 
TTGGGGAGTTGCGGCACGAATTAATCAG 
TTGCGGAGTTGCGGCACCA.ATTAATCAG 
TTGGGGAGTTGCGGCACGAATTAATCAG 



11680 
i .„.., 



TGG 
T G G 
TGG 
TGG 
T G G 
TGG 



G 
G 
G 
G 
G 
G 



A 
A 
A 
A 
A 
A 



C T G A 
C T G A 
C T G A 
C T G A 
C T G A 
C T G A 



11690 
1 



A A G G T T G 
— — — r - 



Majorlty 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



C 
C 
C 
C 
C 
C 



11700 
* i— 



T 
T 
T 
T 
T 
t 



C 
C 
C 
C 
C 
C 



A 

A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



A 
A 
A 
A 
A 
A 



G 
G 
C 
G 
G 
G 



G T T 
G'TT 
G T T 
G T T 
G T T 
G T T 



G 
G 
G 
G 
G 
G 



2603_ail.seq 
18rs21_ali:seq 
cohl_ail .seq 
cjblll_ail.seq 
nem3 1 6_a 11 . seq 
a909_ail.seq 



AAATGGTGAGAAGATTACCATTACCAGT TCAATCGGAAATGGTCAGGATT ifaj or 1 ty 

11710 11720 U730 U740 H750 



1^ 1 i i I C r I tYf^TTlYJl AC CAGTTCAATGGGAAAUGTCACGATi 2 603 ail.seq 

1467 AAA.TGGTG AGA AG ATTACCATTACC AGTTCA ATGGG A A ATGGTC ACCATT ifirJi m L 
169? AAATGGTGAGAAGATTACCAT T*A CCAGTTCAATGGGAAATGGTCAGCATT it ^ 

1699 AAAT'GGTGAGAAGATTACCATTACCAGTTC'AATGGGAAATG ^HmT if^ 

!» A^TCG^G^ 

17O0AAATGGTGAGAAGATTACCATTACCAGTTCAATGGGAAATGGTCAGGATT 3909^71.^ 



2603_a i 1 . seq 
18rs21_ail. seq 
cohl_ai 1 . seq 
cjblU_ail.seq 
nem3 16_a i 1 . seq 
a909_all.seq 




ATA GG Majority 
11850 



A T A G G 
A t AG G 
A T A G G 
A t AG G 
AT A G G 



26p3_all.seq 
18rs2i^ail.seq 
<3oiil_all.seq 
cjblll_all.seq 
ne«316_all.seq 



CTACGCAACTATATCTTTC GT CACATTACA tcta 



844 
617 
849 
849 



11860 
l 



CTACGGAA CT A T A T 
CTACGGAACTAT AT 
C TAG GG AACTATAT 
CTA'CGGAACTATAT 
851 "CTACGGAACTATAT 
850 CTACGGAACTATAT 



U870 
i 



CTTTCGTCACATTA 
CTTTCGTCACATTA 
C T T T C G T C A C A T*T A 
CTTTCGTCACATTA 
C T.T TCGTC ACATTA 
CTTTCGTCACATTA 



11880 
i 



CAG AT AC T A 

11890 
i 



A T A G G a909_all.seq 



CCATCA A Majority 
11900 



CATCTACAGATAGTA 
€ A A T CTACACATAGTA 
CATCTACAGATAGTA 
CATCTACAGATAGTA 
CATCTACAGATAGTA 
CATCTACAGATAGTA 



C C A T G A A 2603__all.seq 

CCATCAA 18rs21_ail.seq 

CCA T G A A cohl_.ail.seq 

C C A T G A A cfblll„ail.seq 

CCATCAA nem3l6_ail.seq 

CCATCAA a909_all.seq 
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WO 2006/078318 ' ^PCT/US2005/027239 

Alignment Report of AM__aiignment using J. Hem method with Weighted residue weight tabie. p ^ ■ „ c 

Thursday. Juty 29, 2004 5:^6 ,«.»,„ ,., » » n „. fe t ,. »... & u q ^ age ^ 

7 T T T T G C T t{ f'#T fe j BgH ^ Q t T^'g" g A T A T A C T A A C A A A T G A A A T A T Majori ty ~ 

U ? 10 ttS2Q 11930 U940 U9SO 

TTTTGCTTATATGACCAACTAAAGTGAGgItATACTAACAAATCAAATAT 2603 ail seq 

Wttl I" TCCTTATATGACCAAG TAAAGTGAGCATATACTAACAAATGAAATAT ISrsi! alHeq 

11899 TTTTGCTTATATGACCAA-CTAAAGTGAGGATATACTAACAAATGAAATAT cohl a it ^ 
H899 TTTTGCTTATATCACCAAGTAAAGTC-ACCATATACTAACAAATCAAATAT cjblll alHea 
11901 TTTTGCTTATATGACCAAGTAAAG T G AGGATATACT AA CAAATGAAATAT nentfie'ai I seq 

11900 TTTTCCTTATATGACCAAGTAAAGTGAGGATATACTAACAAATGAAATAT a909_a7l.seq 

,T T A T T A T C G T A T T T G T C C A T T T T A T C C A A A A G T T T C C A T A T T A T C A T T A T Majority 
_J H? 60 U970 11980 . 1 1990 12O00 

11944 TTATTATCGTATTTGTCCATTTTATCCAAAAGTTTGCATATTATCATTAT 2603 all seq 

Will I TATTATCGTATTTCTCCATT TTATCGAAAAGTTTGCATATTATCATTAT 18rs21 ailleq 

11949 TTATTATCGTATTTGTCCATTTTATCGAAAAGTTTGCATATTATCATTAT cohl ail seq 

\™t 9 t TTATTATCGTATTTCTCCATTTTATCCAAAACTTTCCATATTATCATTAT cjblll aiHeq 

MJ> II*ZZ* i:CGTArr ' rG ? CC *-*'rTTATCGAAAAGTTTGCATATTATCATTAT ne«316~ai 1 .seq 

U950 TTATTATCGTATTTCTCCATTTTATCGAAAAGTTTGCATATTATCATTAT a909„all . seq 

GTTTGATAA. GATGCAAAT ATA ATGATACTAGGAGCTAAATATG GATATTT Majority 
IgglO 12020 12030 12040 12O50 

11994 GTTTGATAA GAT CC AAATAT A ATCAT ACT AGCAC.CT A AATATC GATATTT 2603 ail seq 

GTTTGATAAGATGCAAATATAATGATAGTAGGAGCTAAATATGGATATTT 18rs21 ailTseq 

11999 GTTTGATAAGA'TGCAAATATAATGATAGTAGGAGCTAAATATGGATATTT cohl all. seq 
1999 GTTTGATAAGATGCAAATATAATGATACTAGGAGCTAAAT.ATCGATATTT cjblll alilseq 
2001 GTTTGATAA GA.TGC AAA TATAATCATAGTACG A CCTAAATATG GATATTT ne»3l6~ail .seq 

12000 GTTTGATAACATGCAAATATAATCATACTAGCACCTAAATATGGATATT.T a909_.a7l.seq 

.AAAAAATCAAG AGTATC.CTAAGTGCTTTCC. ATTTTCAAATTCAAATATAG Majority 

12060 12070 12080 12090 12100 

12044 AAAAAATCA AOAGT ATC CTAACTGCTTTCCATTTTGAAATTCAAATATAG 2603_all,seq 

1817 A A A A A A T-C A A G A C T A T C C T A A G T G C T T T C C A T T T T G-A AATTCAAATA T-A G 18rs21_ai Lseq 

2049 AAAA-AATCAAGAGTA'TC.C T-A AGTGCTTTCC AT TTTGAAATTCAAA TA TAG oohl.ail.seq 

£049 A A A A A A TC A A C A GT ATC CT A A G T G C TTT CC A TTT TC A A A TT C A A AT AT A G cjblll all!seq 

2051 AAAAAATCAAGAGTATCCTAAGTGCTTTCCATTTTGAAATTCAAATATAG n«a3l6_ail seq 

12050 AAAAAATCAAGAGTATCCTAAGTGCTTTCCATTTTCAAATT-CAAATATAG a909_aii:seq 

CTAATAGTTCTAGAACTT CTAATTGTTTTTCCTCCAC.GATATGAATTTTC Majority 
_ 12* 10 12120 12130 12140 . 12150 

2094 C T A A T A G T T C T A G A A C T T C t A A T TC T T T T T C G T C G A C G A T A T G A A T T T T C 2603 ail. seq 

.1867 CTAATAGTTC.TAGAACTTCTAATTGTTTTTCGTCGACGATATGAATTTTC 18rs21 aiKseq 

2099 CTAATAGTTCTAGAACTTCTAATTGTTTTTCGTCGACGATATGAATTTTC cohl ail seq 
.2099 CT AATAGTTCTAGAACTTCTAATTGTTTTTCG TC GACGATATGAATTTTC cjblll ail!seq 
2101 CTAATACTTCTAGAACTTCTAATTCTTTTTCGTCGACGATATGAATTTTC nemSieZail seq 

2100 CTA ATAG.TTCTAG AACTTCTAAtTGT T.T TTCGTCGACGATATGAATT.TTC a909_all.seq 

AATCTTAACTGT TAC G A TTCCACCTCCCTTTCCT TA A AG AA A' A A A CCTC A Majority 
12160 . 12170 * 12180 12190 12200 

2144. A A T. C T T A A C T G T T A G G A T T G C A C C T G C C,T. T f G G T T A A A G A A A A A A G G T C A 2603 ail. seq 

1917 AATCTTAACJGTTAGGATTCCAeCTCC.CtTTGGTTAAAG A A A A AA G G t C A I8rs21 ai|l2eq ' - 

2149 A A T C T T A A C TC T T A C G ATT C C AQCTCCC T t TC G T f A A AG A A A A A A GGTC A cohl all .seq 

2149 A AT C TT A AC T G TT A CCA T.T C C A CC X CCqtf T G GTTAAAGAA AAA A ! 'G G TC A cjbllUall!seq . 
2151 A ATC TT A AC T G T T A gB A T T CCA CC TCC C T T T G G T.T A A A G A A A A A A GG T C A nem3 1 6_a i I . seq 

2150 A A T;C T T A A C .T G T TAG G A T T C CA C C T C.C G T T. T C C T T A A A G A A A,A A A G G t C A aM^iLseT' 

GGTC G TTT AG AT A AC.TTTCTC.AAACA AGCTCA AG C TATCTAA AAATACTT Maloritv 

12 210 12220 . 12230 ] , .122 40 12250 

2194 G GTCGTTTAGATAACTTTGTCAAACAAGCTCAACCTATCTAAAAATAGTT 2603 ailise^ 

1967 C GT C G T T T A G A T A A C T T T G T C A A A C A A G.C T C A A G C T A T C T A A A A A T A G T T 18rs21 ailTseq 

2199 G G TC G T T T A G A T A A C T T T G T C A A A C A A G C T C A AG CT ATCtA A AAATACTT cohI_ail ,seq * 

2199 GGTCGTTTAGAT. AACTTTGTCAAACAAGCTCAAGCTAT C.T AAAAATAGTT cjbUl ailTseq 
2201 G GT C GTTTACATAACTTTGTCAAACAACCTCAAGCTATCTAAAAATAGT T nem316 aii.seq 

2200 GGT.CGTTTAGATAACTTTGTCAAACAAGCTCAAGC TATCTAA AAA T-A G T T a909_a7l seq 
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Jfignment Report of AM JB^OJW l?j&? 1(e ihod ^ w ^ ^PCT/US2005/027239 

T G A A A T iclr'T ggg Wt Eg jgl AGCT ATCTGATGAGCAGAAG G Majority : 

12260 122TO 12280 12290 12300 

12244 TGAAATGGGCATTACTCTACTTTTTAATAAGCTATCTGATGAGCAGAAGG 2603 ail seq 

2017 TCAAATGCGCATTACTCTAGTTTTTAATAAGCTATCTCATGAGCACAAGG 18rs21 ail seq 

12249 TGAAATGGGCATTACTCTAGTTTTTAATAAGCTATCTGATGACCACAAGG cohi_a7l seq 
2249 TGAAATGGGCATTACTCTAGTTTTTAATAAGCTATCTCATGAGCAGAAGG cjbUl.aiiseq 
2251 TGAAATGGGCATTACTCTAGTTTTTAATAAGCTATCTGATGAGCACAAGG nem3I6 ailseq 

12250 TGAAATGGGCATTACTCTAGTTTTTAATAAGCTATCTGATGAGCAC'AA-GG a909_a7l.seq 

ACAAGTTAATGCATGTTGGGAAGTCTTATTTTGACTATCAAGAAAATGCT Majority 
. l2 ? 10 ^2320 12330 12340 12350 

12294 ACAAGTTAATCCATCTTCGCAAGTCTTATTTTGAC TAT CAAGAAAATGCT 2603 ail. seq 

12067 AGAACTTAATCCATGTTGGGAACTCTTATTTTCACTATCAACAAAATGCT 18rs21 ailseq 

12299 AGAAGTTAATGCATGTTCCCAACTCTTATTTTCACTATCAACAAAATGCT cohi all. seq 

12299 AGAAGTTAATGCATGTTGGGAAGTCTTATTTTGACTATCAAGAAAATGCT cjblll ailTseq 
12301 AGAAGTTAATGCATGTTGGGAAGTCTTATTTTGACTATCAAGAAAATGCT ne*316_an seq 

12300 AGAAGTTAATGCATGTTGGGAACTCTTATTTTCACTATCAAGAAAATGCT a909_ail.seq 

CTTATCCCACAA TTAGGTTTTCTATATTCTAAATTAACTAAAAAAATTGA Majority . 

12360 12370 12380 12390 12400 
- 1 1 — 1 — — 1 i_ 



2344 CTTATCCCACAATTACGTTTTCTATATTCTAAATTAACTAAAAAAATT G A 2603 aiKseq 

2117 CTTATCCCACAATTAGGTTTTCTATAT.TCTAAATTAACTAAAAAAATTGA 18rs21 alHseq 

2349 CTTATCCCACAATTACGTTTTCTATATTCTAAATTAACTAAAAAAATTCA cohl all seq 

2349 CTTATCCCACAATTAC G TTT T C T A T ATTCTAAATTAACTAAAAAAATTGA cjblil ailseq 
2351 CTTATC. C CACAATTAGGTTTTCTATAT. TCTAAATTA A.C TAAAA.AAATTGA ne*316 ail seq 

2350 CTTATCC CACA ATT A CC TTTTCTAT A T T C T A A A TT A A CT A A A AAAATTG A a909_ail.seq 

ACTTGA TAATCCCTTGtCTCCGACTGAACAAAAGTTATTGATTACCTTAT Majority 
12410 12420 12430 1244Q 12450 

2394 ACTTGATA-ATCGGTTGTC T'.C C G A C T G A AQAA AAGTTATTC ATTACCTTAT 2603_ail seq 

2167 A C T T G A T A A T C G G T T G T C T C C G A C T G A A C A A A A G T T A T T G A T T.A C C T T A T 18rs21_ail seq 

2399 A CT T C A T A A T C GG TT GT C T C CG A C T G A A C A A A A G T T A T T G A XT A C C T T A T cohl_ail seq 

2399 ACTTGATAAT C G GTTGTCTCCGACTG A AC AA AAGTTATTC ATTACCTTAT cjblll_ail^eq 
2401 A C T T G A T„ A A T C G G T T G T. C T C C G A C T G A A C A A A A G T T A T T G A T TACCTTAT nem316_all *seq 

2400 A CT T G A TAATCGGTTGT CT C C G- A C T G A A C A. A AAGTTATTGATT AC C T TAT a909_ail,seq 

TATTACATAC T. A A ACGTTTAATCATTCAT AT CT AAGAAGTAAGTCAGCTA Majority 

12460 12470 " 12480 12490 12500 

2444 TATTACATACTAAAGGT.TTAATCATTGATATGTAAGAACTAAGTCAGCTA 2603_ail seq 

2217 TATTACAT.ACTAAAGGTTTAATCATT.CATATCTAAGAAGTAAGTCAGCTA 18rs21_ail seq 

2449 TATTACATAC -TAAAGGTTTAATCATTGATATGTAAGAAGTAAGTC AG C T A colil„aU seq 

2449 TATTACATACTAAAGGTTTAATCATTGATATGTAAGAAGTAAGTCAGCTA cjblll_ailseq 
2451 TATTAC A TACT A AAGGTTTAATCATTGATATCT AAGAAGTAAGTCAGCTA nein316_aU.seq 

2450 TATTACATACTAAAGGTTTAATCATTGA.TATGTAAGAAGTAAGT C AG C T A a9G9_aii.seq 

A C C G A T C TT TC T ATTCTA AA ACTT ATATTGTTCCTTTAGA AATTTTA AAG Majority 
12510 .. 12520 . 125 30 12540 12550 

2494 AC C G A tC TT TCTATTCTAA AA CTTATAT T G TTCCT.TT ACAA ATTTTA AAG 2603 ail. seq 
2267 A C C C A TCTTTCT AT TCT :A AAA C T T A T A T T G T XG C T T T A.G A A A T T T T A A AG 18rs5i aillseq 
2499 A.CCG "A, T Ct TTCTA T TC T AAA A-'g. T T A T A TT GTTGC.TTT A G.AA A T T T f A A A G cohl^aluseq 

2499 A C C G A T C T T T C T A T T C T A A A A CT T A TAT T G T T G C T T T A G A A A T T T T A A A G cjbUl_ail.seq 
2501 .A C.C.G.A TCTTTCT A TTCT A A A, A C T T A TAT T GTTGGT T T A G A A A T t TTA1AG ne*316_ail.seq 

2500 ACCGATCTTTCTATTC T A AAA C.T T A T ; A"TTC T T G C T T T A G A A A T T T T A A AG a909 aii.seq 

A G C G T G G A t G G C T T C A T. A A T A A A C A C A A A T C T t A C C A A T T T G CG.AAG CC A Majority 

^ 12560 12570 . 12580 12590 *" 12600 

2544 AGCGT.GG AT GGCTTCATAATAA A;C A G A A A T CT TA.CCAATTTGCGAAG CCA 2603_ail seq 

2317 A G C G T G C A T G G C T T C A T A A T A A A C A G A A A T C T T A C C A A t T T G C G A A G C C A 18rs21_ai lTseq 

2549 AGCGTCC A T GG CJTT.C A T A A T A A A CAC A A A TCTT A CC A A TTT..C CC A AG CC A cohCail seoP 

2549 AGCGTCG'ATGG CT T.CAT AA'TAAA C 'A G A A A.T C T T A CC AATTTGCGA AG CCA cjblll.aulseq 
2551 AGC GT GGATGGCT'TCATAATAAACAGAAATCTTACCA ATTTGCGAAGCCA ne«i316_all seq 

2550 AG.CGTGGArCGCTTCATAATAAACAGAAATCTTACCAATTTGCGAAGCCA a909_aii:seq . 
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A A A A A T 



. 12610 * 2 «*> »2S3Q 12 640 126S0 

AAAAATATGATATTTCAAGAGTCCAAAGATCTAATAGATACTCCACTT AC 2603 alt sea 

ll9AAAAA^?r^t^^M GAGTCCAAAGATCTAATACATAGTC ^ lorsit " it^ea 

2599 AAAAATATGATATTTCAAGAGTCCAAAGATCTAATAGATAGTCCAGTTAC cohl all sen 

^? AAAAA 3 ATGATATTTGAACACTCCAAAGATCTAAT *GAT.AGTCCAGT^AC ullfseq 

2 S ^ A t AA A^ A ^ GA ? A ^^ GAAGAGTCCAAAGATCTAATACATACTCCAGTTAG nS"a , 2? 

2600 A A A A A T A T C A T A T T T G A A C A C T C C A A A C A T C T A A T A C A T A C T C C A C T T A C a909_a7l .seq 

ACAAGCCTTCATTATAACTGATAAGCAT TTTCAAAAATT A A AACAACAGC Majority 
12660 »2670 12680 126S0 IZ700 - 

^T AG ^ A ^^ G ^^ GATTATAAGTCATAAGGATT ^" TCAAAAATTAAAA CAAGAGC 2603 all.seq 
^I ACAAGCGTTGATTATAAGTGATAAGCATTTTCAAAAA TTAAAACAACAGC ISrsIl a Heq 
2fUQ A r a a r r !* t't r ittiI" G ^ "TAAGCiTTTTCAAAJATTAAlACAACAGC cohl a ft "eq^ 
«^A GAAGCG ^ GA n ATAAGTGATAAGGATTTTCAAAAATTAAAA < : AAGACC cjblll ail^eq 
^n AGAAGCG n GA n ATAAGTGATAAGGATTTTCAAAAATTAAAA C AA C A GC nLsio'all.seq^ 
2650 A G A A G C C T T GATTATAAGTGATAAGG AT TTTCAAAAATTAAAACAACACC a909_a7l.seq 

TATTATTTTAACCGACTTAT TTTAAAGACTTATCATATCTACCCTTGCTT Majority 
12 T 10 12720 12730 12740 I27S0 

tit ?A??A^^ AACCGAC ^ A i: TT i AAAGAC ^ TATCATATci ' AGGCTTGCT T 2603_a 1 l. Seq 

^?A^A?^^ AACCGAC ^ A ^I TAAAGACTTATCATATCTAGGCTTGCTT 18r S 21_ail?seq 
^ I A TTATTTTAACCGACTTATTTTAAAGACTTATCATATCTAGGCTTGCTT cohl all. sen 
K99 T A T T A T T T T A A C CG ACTTATTTTAAAGACTTATCATATCTACGCTTGCTT cjbm a Hea 
S701TATTATTTTA AC C GACTTATTTTA-AAGACTTATCATATCTACGCTTGCTT 2«Us!dl'!S 
W0OTATTATTTTAACCGACTTATTTTAAAGACTTATCATATCTAGCCTTGCTT M "eT 

CATCATTCCGAAAAATACG GACACTATACTATTTCAACCAAAAGATACAA Majority 
. ; lZ7 . e ° '2770 12780 12790 12 800 

m C A? G A^ CG ^ AAAAA J ACG ^ AGACfATAC ^ ATTTCAAGGAAAAGATACA 2603_all.seq 
»iI GA I C ^^^ CG ^ AAAAATACGGAGACTATACTATTTCAAGGAAAACA TACAA 18rs2I ail^eq 

%% t A T r A II r r r t * a ? A J AC ° ° A * •* C T A T '* CT A TT T C A A G G ' A A A AG A T A C A A cohl_a7t.seq^ 

^s^ c A^ CA ^T CGGA ^^ A ^^ ACGGAGAcTATAcTATT^cAAGGAAAAGATAcAA cjbm_au^eq 

^ GA ? G tJi CGGAAA M^ CGGAG AC TATACTATTTCAAGGAAAAGATACAA „e*316_ail. S e2 
aSOCA-TCATTCCCAAAAATACGGAGACTATACTATTTCAAGCAAAACATACAA a909_a7l.seq 

AACTTTCCAA.TCA A G*T CTT CAACTATACATCCTTCAAAGTCATCGGCTAG Majority 
■ I 2 .? 10 IZSZO 12830 . 12840 I28SO 

2" AAGTTTCeAATCAAGTCTTCAACTATACATCCTTCAAACTCATCCGCTAG 2603 all seq 
S 6 I AAG "I CGAATCAAGTCT ' rCAACTATACATCCTTCAAA CTCATCGGCTAG ISrsit a Heq 
^a AAG III CGAATGAAGTCTTCAACTATACATCCTTCAAA CTCATCCGCTAGcohl all.seq 
™ J J AATCiACI " ICAACTATAC4KCTTCiUC "AIC C tCTAC cjblll a Ueq 

^* AAG ni CGAATCAAGTCTTCAACTATAGA1 : CCTTCAAACTCA TCCGCTAG ne«316~all. s e2 
!800 A A G T T T C G AA T C A A G T C T T. C A A C T A T A C AT C C T T C A A A G T C A T C G G C T A G a909_a7l.seq 

AGATTTCGAATTATGAACCA A T.C C C TTTGATTACTAGAAAAATAAATACC- Majority 
■ , .. . 12860 . . 12870 12880 12890 . 12900 

A G A .T, J T .G G A A T T A T G'A A C C A A T C C C T T T G A TT ACTAGAAAA'ATAAA T.A G-C 2603 all. sea. 

^ a c at I I Zt A * 11 A T G A ACGA ATCCCT T T.C A IT A C T "A G X A A A A T.A A AT A G C 18rsil ail^eq 

A Zt i T T G G A A T T A T G A A C C A A T C C C T T T G ALIA C T.A.C A A A A A T A A A T A G C cohl all.seq 

!849 AGATTTGGA ATT A T G A A CCAATCC.CTTTGATTACTAGAAAAATAAATAGC cjblll alHea 

*£ * I J I G 6 A A T T A T G A A C G A A T : C C C T T T.C A T T A G T A .'G A A A A A T A A A T A G C ^1 o"a !s2 

»50 A G ATI T- G G A A T T AT G A AC CAAICCC ITT G A T T A C TAG A A A A A T.A A A T A C C a909_sil.seq^ 

T T G G A G A G G C T A A C G T C A C A C T G G T T G A T.C C A A T C T C G C T T T ATT TAACA Majority 
\ t2 ? 10 ^g20 12930 1294Q 12950 

Z I ^5 ^ G A ° G 10 T A A ^ C G T ' C > CACTGC.TTGATCCAATCTCGCTTTATTT A A C A 2603 all.seq 
^IJSa^ CAGGCTAACGTG A CACTGGTTCATCcaatc T 18rs21 allTseq 

^ T I G ^ A « AGGC TAACGTGACACTGGTTGATCCAATCTCGCTTTATTTAACAcohI all.seq^ 

^Ofl T T*r r*A ri"rro-rA^I^^ TGACA G T G G ^T GAT ^^^ATC'T. CGCTTT A.T T T A A C A ne^ielall.^ 
t900 TT G.GA GAGGCTAAC G-T GACACTGGTT P- A TCC.AATCT CGCTTTATTTA AC "A a909 all.seq 
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C T A A G A A"T G"A TCAAC ACCCt CGtAlt^AACAACAAC T T GACCACCTACAA Majority 
_ . 12 9 6Q 12970 12980 12990 13000 

!Sf 4 " AACAAT CATG,AAGACCCTCGTATTCAAGAAGAAGTTGAGCAGCTAG'AA 2603_aii seq 
12717 CTAACAATGATGAAGACCCTCCTATTGAAGAAGAAGTTGAGCACCTAGAA I8rs21 ail seq 
12949 CTAAGAATGATGAAGACCCTCGTATTGAAGAAGAAGTTGAGCAGCTAGAA cohl aTl seq 

12949 CTAAGAA.TGATGAAGACCCTCGTATTGAACAAGAAGTTGAGCACCTACAA cjblll ail seq 
12951 C TAAGAATGATGAAGACCCTCGTATTGAAGAAGAACTTGAGCAGCTAGaI ne«316~atl seq 

12950 CTAACAATGATGAACACCCTCCTATTCAACAAGAACTTCAGCACCTAGAA 3909 ail seq 



GATAA C A T 



12994 CATAAGAT 
12767 CATAAGAT 
12999 CATAAGAT 

12999 G A 

13000 ■ A 

I3OO0 CATAAGAT 



Majority 



2603_al 1 .seq 

18rs21_.all.seq 

cohl_ail.seq 

cjblll_ail.seq 

nea316_ail.seq 

a909_ail-seq 



Decoration 'Decoration #1': Shade (with solid black) residues that differ from the -Consensus. 
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51 
51 
51 
51 
51 



■ ■Li ' ' * 1 ' *■* " ~ * 



CGCCT.TGI'ICC GATGTTGATl 

CGCCTTGTTCCGATGTTGATCCCGATAACTCCTCGCTCATTAATArrrTr "°* 5 ' ; ;- al ' I - se< l 
CCCCTTCTICCCATCTICATCCCCATAACTCCTC CtC UU^G C« 
«CCCTTUtCCC»TCTTCATCCCCATAACTCCTCG"«"nncCCuK7 



TTCGTAACGCTCTTTAATTATCTCTAACTT AGCATGGGTATTGGTAAA AT Majority 

100 



60 

JL 



70 



80 

JL 



90. 

JL 



TTCGTAACGCTCTTTAATTATCTCTAACTTAG C ATGGCTATTGGTA AAAT 2603 ai2 ^ 
T^C^T^ACGCT^T^T^T^^ 

llvlr AACCC^C^^ A TAACTTAGC ATGGGTATTGGT.AAAATSlS^seq^ 

Ileal A AC CC^TTTT A A I I A I ? T C T A A C T T A G C A T G G G T A T T G G T A A A A T cjbl 1 l.alTseq 
TTCGTAACGCTCTTTAATTA T C TCTAACTTAGCATGGGTATTGCTAAAAT h36b_a7 2 .seq 

TTTGAAAATAGACTAAGTATTTATTAA C CTCACGCCACTTTCfATCCATG Majority 



101 
101 
101 
101 
101 



151 
151 
151 
151 
151 



201 
201 
-201 
201 
201 



T T T G ATirTTG^cT^ 2603_al2.se* 
TTTGAAAATArArT AA rT A TTT A C _W TCACGCCACTTTCTA TGCATG 18rs21__12*L, 

^?r A A AA ^ A r A ^ AA ^ ATTTATTAA C|lT CAGCC CACTTTCTATGCATG515 ai2.seq 
III CAAAATAGACTAAGTATTTA T T A A C.C TCAGGC CACTTTCTATGCATG clhlll 

TTTGAAAATAGACTAAGT-ATTTATTAACCTCAGGCCACTTTCTATGCATG ^"^^ 
A A ATC AATTTCTTT A TAG A ATT GTTCACG A ATACCAGCTTCTCG AGCA A C Majority 

160 170 180 190 ~20b 



A A ATC A AT I T c£ tI A I A r a a J ?? J S 5 ^ * ^ A G C A C C TT C T G G A G C A A i 2603_al2.se q 

A A A £ _ £ ATT ?r^T A T A r AA ^^i^ ACGAATAGGAGCTTCTGGAGCAA 18rs21_alCeq 
AAA^CAA^?c?^A^ A r^ 

* « . i« A ! TATAGAATTGTTCACGAA TAGGAGCTTCTGGAGCAAC clblll ai2 seo 

AAATCAATTTCTTTATAGAATTGTTCACGA AT AGGAGCTTCTGGAGCAAC h^ala !seT 




TAT At CAT CCC€, A A C ? A A A C T G T G C A A A A A G T G C A C G C T C C T C T A G 2603_al2.seo 

TATAGC*ATC'rrrTrAArr 18r32l^al2^eq 

^A?^GCA^CCCC?rA^r? A ^ S15_.al2.seq 

V?i-^ CTGAACCAGAAACTGTGCAAAAA GTGCACCCTCCTCTAG clblll _12 sen 

TATAGCATCCCCTGAACC AGAAACTGTGC AAAAAGTGCACCCTCCTCTAG ^_al2 seq^ 

CAACTGfTCCC TCTCTGTT ACGACAGTCAAAACCACCATCTATAGGTAAT Majority 



251 
251 
251 
251 
251 



301 
.301 
301 
301 
301 



"351 
351 
351 
351 
351 



401 
401 
401 
401 
401 



CAACTGTT fcCGTCTc'TrTTArr^r^ 2 603_a*2.seq 
CAA^^^rr^I^^I AGGACAGTCAAAACC ' AGC ^ ISrsai^alCeq 
CAACTGTTCCGTCTCTGTTAGGA.CAGTCAAAACCAGCATCTATAGGTAAT51S ai2 sea 

IAACTGTTCCGTCTCTGTTAGGACACTCAAAACCAGCATCTA.TAGGTAAT h36b_al2.seq 

T T A A A T,A T T T T T T C T C C A A A G A G T.T C T C C A T A A T A A T C A T T A A T C C C A C G Ifamn^ 

3,0 -. 320 . . 330 . 340 ~3S0 



•T T A A A T AT T llJll I CC A A A t A 5 1 1 SIS-? * A T A A t C A T t A A-T C G C A C G 2603^12.^ 
T T A i 11 A 1 1 1 1 1 t r I S r * « , S ^ I I S I 9. Q * T A A T * A T C A T T A A T C G C A C G IS^^aiCeq 

T T. A A-ATATTTTT T.C T:C C A A A G A G T T C.T C C A T A A T A A T C A T T A A T C G C A C G hWal^.seq; 

A T A A C G T T T T T T C A T A G C A T A A TT G T A T C A C A ATT ITAA C T A A A A T A A c. r KajorAty 

360 . 370 - 330 • 390 ' 



* ? A A CG^It^tTp A I*CGAT.AATTCTATCACAATTTTAACTAAAATAACC 2603.aA2.seq 
A T A AC GTTTTTTCATAGGATAATT G-T AICACAATITIAACIAAAATAACr ,1?™ 

a^aacg^??tTa?^ca^ 

AT A A C CTTT.TTT-CAT AGC AT.A A TTSWT.C AC AiTTTT A ACTA A A ATA AC C cibUl aI2 sea 
A T A A C G TT T T T T C A T A C.C A T A A T T G T A T C A C A A T TTT A ACTA AAATAA'CC ^"all!^ 

TCACTACTACAAT-AAAACTAAAAAAG ATTCGAACCTCACTTAGTCCCAAT Majority 
_^ 4 ™ 430 440 .450 

T C A r. ^ 1 t r ^ ^ i ^ 1 ^ ^ n ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ — ^ ^ ^ J. 1^ ^ G ~^ ^ C C ^ ^ ^ 2 603_aA2.seq 



^CAC^AC^C^?A^Ar? AA MMSr TG ' CAAC ^ ASrXalOeq 
?r A ST A SI A e AA ^ AAA SI^ AAAAACATTGGAA C HTC A GTTA C T C C C A A T SlS^.seq 
TCACTACTACAATAAAACTAAAAAAGATTCCAA c lM»HM BMMU r. tEI C C A A t-ikTii ™ 

tc»c.tactacaaiaaaactaaaaaacaitcca5cT?U^ct?ccoJS K C 



ffS^? WQ^20p6/0 783 1 8 iretlWwrth Weighted residue weight tabl 



£7/ 



451 
451 
451 
451 
451 



501 
501 
501 
501 
501 



551 
551 
551 
551 
551 



601 
601 
601 
601 

.601 



651 
651 
651 
651 
651 



701 
701 
701 
701 
701 



751 
751 
751 
751 
751 



c r T T TWffl J'lfalk^^ 

■ ~ 1 

460 470 480 



CT/US2005/027239 . 

rage 2 



TCCTTCCCTAAAAAG A T A T A C 



Majority 



490 



500 



601 
801 
801 
.801 
801 



*851 
851 
"851 
851 
851 



GCACT T A G A T T CAAAATACCA TAA GCAAGTATAAAAr.CAGCTA A A A r. A T r* teJority 

5 ?° S40 ^ 

~ l_ 



2603_ai2_seq 
18rs2l_aI2.seq 
5 I5_a 12. seq 
cJblU_aI2.seq 
h36b_ai2.seq 



— i — 

510 
l 



GCAGTTAGATTC A A AATACCATAAGCAAPT A x* AAA ^^ A ^^"^ AA *^^ A ^* G 3603_aA2.se,, 

GCAGTTAGAXXCAAAA T A CCATAACCAAGTATAAAACCACCT A A A A C A .X C J^lfo^aT^seq^ 

Majority 



2603__ai2.seq 

18rs21_ai2.seq 

515_jal2-seq 

cJbHl_al2.seq 

h36b_ai2.seq 




TGTCCGAAAATG 
TGTCCGAA A ATC 
TGTCCGAAAATG 
TGTCCGAA AATG 
TGTCGGAAAATG 



AACCCCTAG 
AACCCCTAG 
AACCCCTAG 
AACCCCTAG 
AACCCCTAG 



G T A A 

G T A A 

G T A A 

G T A A 

G T A A 



ATACGAGATAACCCAATTAAAAAAA 
ATACGAGATAACCCAATTAAAAAAA 
ATACGAGATAACCCAATTAAAAAAA 
ATACGAGATAACCCAATTAAAAAAA 
ATACGAGATAACCCAATTAAAAAAA 



XGJLG C A A A C C CA AACTACCTTCGC AC AACACTTTf!r a T A T A 



— r 
610 



620 



TT 

630 



— 1 — 

640 
1 



CTCTTAGGC Majority 

.650 



-TGAGCAAACCCAAAG T"A CCTTGGC A CA AC AG TTTC C 4 T A T A r YT^FTTTTT 
TCAGCAAACCCAAA.TACCTTGGCACAAC , J 

TGAGXAAACCCAA AH T AC CTTC6C A C A AC AGTTTCC AT AT AC Sin C c? 
T C AG C AAACCCAA AQ T A C C T T G g|| ACA AC AGTTTCC ATAT AC TC T T A G G r 
TGAGCAAACCCAAAGTACCTTGGCACAACAGTTT " 



2603^12. seq 
I8rs2L.al2.seq 
515__al2'.seq 
cjblll__al2.seq 
h36b__a!2 .seq 




a>aJa«act«o A4 J * AAA ^ ataaiactccc " atat ^**tgttcc 

A^ A ^^^Ac^G^ AA ^ A ^ A ^^ AA ^ AATAcTcccAAATATcATAAA TGTTcc 

Ai A ^AG^c?G> AA ? A *■ I AA I AATA CTCCCAAATATCATAAATGTTCC 
A T A T A G T'A £ T'G C A A' T jv A a A t ^ ^ t A ^^''^^''^AATATCATAAAT G'T X C C 
ATATAGTAC.TGCAATAAAATAATAATACTCCCAAATATCATAAATGTTCC 

A A.C G A A T A 




ca£cga^gccc£c?^ 

CATC'GACTCCCCArTfrM 
CATCGAC?GCrr*r£r?r^ 

catcgagtgcccactgggaaacgaatagccacctgcaaatactaaatgcg 



CATCGA. GTGCCCACTGGG. AAACGA.ATAGCCA 



C C T 



26G3_ai2.seq 

18rs21_aI2.seq 

515_al2 < seq 

cjblll__al2.seq 

h36b_.al2.seq 

Majority 



2603_al2.seq 
18rs21_al21seq 
5 15_al 2. seq 
cjblll_al2.seq 



— — +J — ™ • 

GCAAATACTAAATGGG h36b_aA2 .seq 



TTAAA.G.TTCGTCTTACTCTTTGAAAAATAAGTTTTA A A C A A A G T A T A r. A T MaJorlty 

760 770 • 780 . . 790 Z> 



£ ^ A A A G fit X'C X.X A C Q X C II 111 A A A A T A tf? XTT it AG A A A t V, X A C „ X JV^ 

s^aa^^*^*^?^ 

"AV-Jx.c^ 

A.T ACCAG ACATAATA G C A f T T A C TC C C A TA A A T C T A C C T T a a 

' ■ • • ^"^^^^ fc ^""^^^^*^w^»^w»« J .... ^.-MaBaz;,^,^^ _ . 



810 

JL. 



820 
i 



830 
l 



T 

840 
— t 



G G A T- A C C A Uajorlty 
850 

L. 



A^CCA^AX^AXA^t^ 
AXAC^G^AXAAXA^C^ 

* t * V * w A X A A T«A GCATTTACTGCGATAAATPTAP r*v t r> a r»r> _ a . .. ,,~ _ ^ 

ATACCAGA GAT A- ATA GCATTTACTGC G AT A *A A i C T A G fc T T G A GG AT AC C A 'h^eD^a^^seq^ 



CTTCTTAAGGTA A.C ACAAAGTCACGCTCATA ATCGCAATAGCTA TC T C C 




CTTCTTAAGGTAACAGAAACTGACGC 
CT'TCTTAAGGTAACA GAAAGTGACGC 
CTTCTTAAGGTAACAGAAACTGACGC 
CTTCTTAAGGTAACAGAAACTGACGC 
CTTCTTAAGGTAACAGAAACTGACGC 



TCATAATCGCAATAGCTATCTGGC 
TCATAATCGCAATAGCTATCTGGC 
TCATAATCGCAATAGCTATCTGGC 
T C A TA ATCGCAATAGCTATCTGGC 
TCATAATCGCAATAGCTATCTGGC 



Majority 



2603_.al2.seq 

18rs21_al2.seq 

515_al2.seq 

cjbll__al2.seq 

h36b_ai2.seq 
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3 



901 
901 
901 
901 
901 



951 
951 
951 
951 
951 



1051 
1051 
1051 
1051 
1051 



1101 
1101 
1101 
1101 
1101 



1151 
1151 
1151 
1151 
1151 



1201 
1201 
1201 
1201 
1201 



1251 
1251 
1251 
1251 
1251 



TTA 
T T A 
TTA 
TTA 
TTA 



CAGTATTACC 
CAGTATTACC 
CAGTATTACC 
CAGTATTACC 
CAGTATTACC 



AATCACAGTGATTAACTTGAAAAATCTTG 
AATCACAGTGATTAACTTGAAAAATCTTG 
AATCACAGTGATTAACTTGAAAAATCTTG 
AATCACAGTGATTAACTTGAAAAATCTTG 
AATCACAGTGATTAACTTGAAAAATCTTG 



TAGAAAGA 
TAGAAAGA 
TAGAAAGA 
TAGAAAGA 
TAGAAAGA 



Majority 



2603_ai2.seq 

18rs21_al2.seq 

515_ai2.seq 

cjblll__ai2.seq 

h36b_al2.seq 



TTTGGCAACTG TCCTCTAACACTT T CTTGA.A 



960 
— i 



— i — 

970 



— i — 

980 
i 



TTTCGCAACTCTCCTCTAACA C T TTCTTGAATA 
TTTCGCAACTGTCCTCTAACACTTTCTTGAATA 
TTT. GGCAACTGTCCTCTAACACTTTCTTGAATA 
TTTGGCAACTGTCCTCTAACACTTTCTTGAATA 
TTTGGGAACTGTC. CTCTAACACTTTCTTGAATA 



IAiT TTGGT CAAATGAAAT Majority 

1000 



— r 

990 
— i 



GTTTGCTCA AATGAAAT 2603_al2 seq 
GTTTGGTCAAAT G A A A T 18rs21__ai2.seq 
GTTTGCTCA A A T gQ A A T 515_ai2.seq 
GTTTGCTCA AATGAAAT c|blll_al2 seq 
GTTTGCTCA AATGAAAT h36b_at2.seq 



1001 
1001 
1001 
1001 
- 1001 




T A C A G 
T A C A G 
TAC AG 
TACAG 
T A C A G 



T 
T 
T 
T 
T 



G 
G 
G 
G 
G 



T G A AAAATAAGA Majority 

1050 



T 
T 
T 
T 
T 



C 
C 

c 
c 
c 



G 
G 
G 
G 
G 



G G C 

G G C 

G G C 

G G C 

G G C 



C 
C 
C 
C 
C 



A 

A 
A 
A 
A 



ATATTTGA 
ATATTTGA 
ATATTTGA 
ATATTTGA 
ATATTTGA 



T G 
T G 
T G 
T G 
T G 



A 
A 
A 
A 
A 



C C 
C C 

cc 
c c 
c c 



A 
A 
A 
A 
A 



A 

A 
A 
A 
A 



T 
T 
T 
T 
T 



C 
C 
C 
C 
C 



c 
c 
c 
c 
c 



T A A 
T A A 
T A A 
T A A 
TA A 



A 
A 
A 
A 
A 



C 
C 
C 
C 

c 



T 
T 
T 
T 
T 



G A A 
G A A 
G A A 
G A A 
G A A 



AAATAAGA 
AAATAAGA 
AAATAAGA 
AAATAAGA 
AAATAAGA 



T A A TAGC 



2603_al2-seq 

18rs21_jai2..seq 

515_aI2.seq 

cjblH_al2-.seq 

h36b_al2.seq 



AATAAATG C T T G 
* 



A AT AACTTTACTA TTTTCACCACATAACATT Majority 



T A A.T A G C A A 
A G T C T T T T T 



G C T T G A 
G C T T G A 
G C T T G A 
G C T T G A 
GCTTG A 



TTTACTATTTTG 
T T T A C T AT T T T G 
TTTACTATTTTG 
TTTACTATTTTG 
TTTACTATTTTG 



ATAACATT 26G3__aI2.seq 
ATAACATT 18rs2I_al2.seq 
ATAACATT 515_ai2.seq 
ATAACATT cjblll_al2.seq 
ATAACATT h36b_ja!2.seq 



A T A T C 
* 



TTTCT A AT A T T G G C A 



1110 



A 



A C 



1120 

L 



1130 
i 



A A G C C A C G T A.A G T T A G A T 

' : : r~ 



AGTC-TTTTTATATCTTTCTAA T A TTGGCAAAC 
AGTCTTTTTATATCTTTC.TAATATTGGCAAAC 
AGTCTTTTTAT.ATCTTT.CTAATATTGGCAAAC 
AGTCTTTTTATATCTTTCTAATATTGGCAAAC 
AGTCTTTTTATATCTTTCTAAT A TTGGCAAAC 



i 

1140 
, f 



Majority 



1150 
— 1~ 



AAGCCACGTAAGT TA GAT 
AAGCCACGTAAG T T A G A T 
AAG. CCACGTA AGTT.AGAT 
AAGCCACGTA AGTTAGAT 
AAGCCACGTA AGTTAGAT 



2603_ai2. seq 

18rs21_al2.seq 

515_jal2.seq 

cjblll_al2.seq 

h36b_aI2.seq 



ACAAAACAATCCAAATTAAAATTCCCTCAAC f, ATATTAAATGGAATAACr. Majority 

1160 »™ 1180 1190 . 7^0 



AGAAAACAATCGAAATTAAAATTCC 
AGAAAACAATCGAAATTAAAATTCC 
AGAAAACAATCGAAATTAAAATTCC 
AGAAAACAATCGAAATTAAAATTCC 
AGAAAACAATCGAAATTAAAATTCC 



CTCAACGATATTAAATGGAATAACC 
CTCAACGATATTAAATGGAATAACC 
CTCAACGATATTAAATGGAATAACC 
CTCAACGATATTAAATGGAATAACC 
CTCAACGATATTAAATGGAATAACC 



2603_al2,seq 

18rs21_ai2.seq 

5l5_ai2.seq 

cjbll.l_ai2.seq 

h36b_ai2.seq 



ATTGTTAAAAGGTAATTCCCTACACCAATAA ATGTTCTGATATCAAAGTT Majority 

1210 1220 123fi 1240 ~ ^250 



ATTG T ; T*A' A A A t C TA ATT G*£ TTA r A *r r* *^"-r ^ ^ G T T C\T G A T" A TCAAAGTT IS^tali^eq 
ATT *G* T T A A A A G G %~A A T T G' C C T.A C A C G A A T A AA TGTT C T G* A TP .A T CAA-A G T.T S2£^ 




C T A C C A Majority 



1301 
1301 
1301 
1301 
1301 



AGCAAATA T A G-C.A T A C A A A C G A AT C G C AAA G A C A T A G TT.C AGAGCTACCA fcKb.lal^seq 6 '' 

Majority 



2603_al2.seq 

I8rs21_al2-seq 
515_al2.seq 

cjbHI_ai2.seq 
h36t__ai2.seq 
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CT/US2005/02723? 4 



1401 
1401 
1401 
1401 
1401 



1451 
1451 
1451 
1451 
1451 



1501 
1501 
1501 
1501 
1501 



1551 
1551 
1551 
1551 
1551 



1601 
1601 
1601 
1601 
1601 



1651 
1651 
1651 
1651 
1651 



TTTGC ACTCTCTCT ATT 



1351 
1351 
1351 
1351 
1351 




TTTGCACTCTCTCTATTTTTCCACAAAATAGCG AAACTTGCTAAA 
t T -r r* A G ^ G ^ CTCTATTTTTCC AG A A AATA GCGAAACTTG CTAAA 

TTTGCACTCTCTCTATTTTTCCAGAAAATAGCGAAACTTGCTAAA 
TTTGCACTCTCTCTATTTTTCCAGAAAATACCC AACtJc JaJI 
TTTCCACTCTCTCTATTTTTCCAGAAAATAGCGAAACTTGCTAAA 



A A T A A 

A A T A A 

A A T A A 

A A T A A 

A A T A A 



2603_al2.seq 

18rs21_al2,seq 

515_al2.seq 

cjblll.jai2.seq 

h36b_ai2.seq 




A Gr^ A r A rrM^ A 3 ATTCATCGGTAAACC ^ ATA 
AGC^AGAr^ AA rr A ^ A ^^ ATCGG ^ 

AGCTAGACCAACCATATTCATCGGTAAACCAA T A AACGTTTCTGG ACCAC 
G A 




2603„ai2.seq 

I8rs2llal2.seq 

515_ai2-.seq 

cjblll_al2.seq 

h36b_ai2.seq 

Majority 



G A T T AG C 
G A T T A G C 
C A T T A G C 
G A T T AG C 
G A T T A G C 



AAGTATAACTTTTAAAAGTGATCTTAATAAGAGTA 
AA ^I A ^ AAC?TTTAAAAGTGATCTTAATAA < ; AGTACACCATAA 

A^c?A^^r^^ AAAAGTGATCTTAATAAGACTAC ^ 

AAG^A?AAC?^? AA t A r^ A ^^ cJbTl^a^seq 
AAGT.ATAACTTT.TAAA AG TGATCTTAATAAGAG TACACCATAA h36b_al2.^eq 



15O0 
j. 

CACCATAA 2603_al2.seq 

18rs21__ai2.seq 
515_al2.seq 



CTTCATTTCAAATCAA ATA A 




Majority 



CTT GATTTC.AAAT 
CTTGATTTCAAAT 
CTTGATTTCAAAT 
CTTGATTTCAAAT 
CTTGATTTCAAAT 



1550 

A T C G G a'^G IT? G A 2603_ai2,seq 



CAAATAAAATAAAA G=C A A C T A A C 

P A A I T 1 i i'i t i ^ i G w" A ACT MCATCGGAAGGATTGi IS^^lSleq 
CAAATAAAATAAAA GH A A C T A A C A T C G G A. A G G A T T G A 515 ai2 sea 

^ AAA J AAAATAAAAGCAACTAACATCGGAAGGA TTGA cjb7ilji2?seq 
CAAATAAAATAAAAGCAACTAACATCGGAA G.G A T T G A h36b_jii2 seq 




Majority 



2603_ai2.seq 
18rs21_al2,seq 
515_aIZ.seq 
cjblll_jal2.seq. 
h36b_a£2.seq 



• TCATCAATAC A. A A A G A T A A GG C 
TCATCAATACAAAAGATAAGGC 
TCATCAATAC A AAA G A T A A G G C 

TCATCAATACAAAAGATAAGGC 
TCATCAATACAAAAGATAAGGC 



AGAAAGAA 
AGAAAGAA 
AGAAAGAA 
AGAAAGAA 
AGAAAGAA 



TGGCGATTGTCACCATTTTA 
TGGCGATTGT C.A CCATTTTA 
TGGCGATTGTCACCATTTTA 
TGGCGATTGTCACCATTTTA 
TGGCGATTGTCACCATTTTA 



2603_ai2.seq 
18rs21_al2.seq 
5l5_ai2.seq 
cjblll_ai2.seq 
h36b_ai2 .seq 



CGTGTATTTGTCATAAAAAAATTC 



i 

1660 



1670 



CTCCA ATTTAAATAAAT.TGAAAGAAG Majority 



CGTGTATTTGTCATAAAAAAATTC 
C G T G T A T TT G T C A T A A A A A A A T f C 
C G T G T ATT.T.G T CAT A A AA A A A f T C 
C C t G T A T T T G T C A T A A A AAA AT T <T 
C G T G T A T T T.GT C A T A A A A A A A T T C 



1680 
i 



GTCCAATTTAAAT 
C T C C.A A T T T AAA T 
C'TCC A ATT T A A A T 
CTCCAATTT A A AT 
C T C C A A T T T.A A A T 



"T 

1690 
i 



1700 

— ' ~ L» 

A A AiTfG A AAG.A AG 2603.Lai2.seq 
A A A T t G A A A G A A G 18rs2i_.al2.Seq 
A A A T T G A A A G A A G- 515_al2.seq 
A:A A TT G A A . A G A A G cJbUl_al2.seq 
A A A T T G A A A G A A G h36U_ai2.seq 



C T.C C A A A G G t A A G C G T A T G T A C G C 

" . T~! - : — : T ■ ■ 



C A A A A AA : CCTT.T. 



C TjC TTCT.CCCATC Majority 



1701 
1701 
1701 
1701 
1701 



1750 
1750 
1751 
1750 
1750 



CTCCAAAGGTAAG C'G T A T G T A C-G CGAAAAAA 

CTCCAAAGGTAAGCGTAT.GTACGCGAAAAAA 
C T C C A A A G G T A A G C G T AEG T A C G.C'G A AA AAA 
C T C. CAAA.GtJTAAGCGTATGTACGCGAAAAAA 
C T C C A A A G G T A A G.C G~t AT GT A C'G CGAAAAAA 



CCTTTGTCTTCTCCCA T. C *2603ja!2-seq 
CCTTTGTCTTCTC CC ATC 18rs21_al2.seq 
CCTTTGTCTTCTCCCATC 515_al2.seq 
CCTTTCTCT.T'CTCCCATC cjbllljai2.seq 
CCTTTGTC.TTCTCCCAT C h36b_ai2.seq 



C AG ACTTT AC TCTCCCTTGTGC AATCtCACCACATCA (1 C. T T TCCCTC 



1760 
■ 



1770 
« 



CAGACTTTACTGTCGGTTGTGG 
CAGACTTTACTGTCGGTTGTGG 
CAGACTTTACTGTCGGTTGTGG 
CAGACTTTACTGTCGGTTGTGG 
CAGACTTTACT G T CGGTTGTGG 



1780 



AATCTCACCACATCA 
AATCTC ACCACATCA 
AATCTCACCACATCA 
AATCTCACCACATCA 
AATCTCACCACATCA 



— r 

1790 
— I 



CCG Majority 



1800 
— i— 



GCTTTCGCTCGCG 
CCTTTCGCTCGC, G 
GCTTTCGCTCGCG 
GCTTTCGCTCGCG 
GCTTTCGCTCGCG 



2603_ai2.seq 

18rs21_ai2.seq 

515_aI2.seq 

cjblll_al2;seq 

h36b^al2.seq 
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1800 
1800 
1801 
1800 
1800 



CACTGATGCTTCACAACTGACAAATAAGTTG 



r^rJr!^ G , C n CTGACAAATAACTTGG AAGCGATTACCCC 



CAAGCCATTACCGCCGGTC 2603_al2.seq 



GACTGATGCTTCACAA 
GACTGATGCTTCAC 
GACTGATGCTTCAC 



C T G A C A A 



AACTGACAA 



A T 
A T 



C G G T C 18rs2I_.ai2.seq 



AAGTTGGAAGCGATTACCGCCGGTC 5I5_al2 seq 
AAGTTGGAAGCGATTACCGCCCGTC clfoll! a!2 «5«i 
AACTGACAAATAAGTTGGAAGCGATTACCGCCGGTC h^olaTz!^ 



1850 
1850 
1851 
1850 
1850 



-1900 
1900 

isoi 

.1900 
1900 



GGG^AATTACACCCTGCCCTCAACACACCTATAGC A TAACAAAAAAAACTT Majority 

1860 » 8 ™ 1880 1890 



GGCAATTACACCCTGCCCTGAAGACACCTATAGCAT A S15_ai2.seq 

CCAATTGCAACTTTTT TAATTA CTAA T TACTAGTAnTGATTAAAAArr.» T „ aJorlty 

1910 1920 1930 1940 _ ^0 



'CAATTGCAAG^T^^AtJacJaA^A^ 
CCAATTCCAAGTTTTTTAATTACTA AT I AC T A C I A G T C J ^ A ^ A aJ C a£ f h7T ?T 
GCAATT-GCAAGTTTTTTAATTACTAATTAGTAG^ACTGA.TT'AAAAATCAT MSb^aiTzlseq 6 ^ 



1950 
-1950 
1951 
1950 

.1950 




•2000 
2000 
2001 
2000 
2000 



• A XT A AT A CCA A ATTACTATACTGTATCGTTTC T T T C A G A T T T r r T * - r . >nrt . J 
ATTAATACCAAATTACTATACTGTATCGTTTCTTTCAf JJJ? ^IrH 2603_ai2.seq 

ATTAATACCAAATTACTATACTGT U J I It l^^^ 

ATTAATACCAAATTiClATACTBTATCCTTTC^TTriPiJJISSTtllll Sl5_ al 2.seq 
A ^ ^ " A ^ ^ ^ ^ A aaT ^ a £ t ATAC TG. TATCGTTTCTT T C A'CATTTGCTATT JT ^eb^aT^lseq 6 ^ 

TAGTTTTTCTTA AAA AG ATA A A C A A A ATTCCC A A A A TA A T A 

2010 2020 2030 . 2040 



TAGTTTTTCTTAAAAAGATAAACAA 
TAGTTTTTCTTAAAAAGATAAACAA 
T A G T'T T T'T CTTAAAAAGATAAACAA 
TAGTTTTTCTTAAAAAGATAAACAA 
TAGTTTTTCTTAAAAAGATAAACAA 



C A A C C A A G A Majority 

205O 



A A T T C C C 
A A T T C C C 
A A T .T C C C 
A A.TTCC C 
A A T T CCC 



AAAATAATACAACCAAGA 
A A A A T A AT ACAACCAAGA 
AAAATAATACAACCAAGA 
AAAATAATACAACCAAGA 
AAAATAATACAACCAAGA 



2603_al2.seq 

18rs2l_al2-seq 

515_jai2.jseq 

cjblll_al2.seq 

h36b_a!2.seq 



2050 
20S0 
2051 
2050 
2050 



ATT.CTCAG TCC T C CA C C A A T A ATCATTCCTGT T T T A G G A A G A A A T G A T T ? Majority 

2100 



i. 

2060 



T 



ATTGTCAGTCCTCCACC 
ATTGTCAGTCCTCCACC 
ATTGTCAGTCCTCCACC 
ATTGTCAGTCCTCCACC 
ATTGTCAGTCCTCCHCC 



2070 
i 



i 

2080 
i, 



2090 



AATAATCATTCCTG TT TTAGGAAGAAATCATTG 2603 a!2 sea 

AATAATCATTCCTGTTTTAGG A'A GAAATGATTG 18rSf a iT^a 

^t 1 A I, C ! TTCCTGTTTTACGAAGAAATCA "G llTlklea^ 

Mt AA ^! TTCCTGTTTTAGGAAGAAATC ATT.G cJbTll_ai? S eq 

AATAATCATTCCTGTTTTAGGAAGAAATGATTG h36bjii2.s^ 



TGGAAAAAGCGGT T G T C ATCCT T TAG 

2110 2120 



CATTTGTT G GT.G GAG 



2160 
2100 
2101 
2100 
2100 




TCTT Majority 



TGC.AAAAA C. C G G T TG TG A TGGTTTAG 
T G G A A A A A G. C G G f T G t G A T C G T T T A G 
TG G A A A A A G CGGTTGT'G A T G G T TT A G 
T G G A A A A A G C G G T T G T G A f G G t T T A G 
T G G A A A A A G C G CT-.T G T G A T GG.T T T A G 



X TTCG 



T C T AC f T C 



GATTTGTTGGTG^AGG AGTT 
G A T T-.T GTT'GGTGG AG 6 AGTT 
CATTTGTT C G T G.G AGG ACT T 
G A T T T G T TG G/T G G A G GAG T T 
G ATT TG TTC GT.G G A GG A G T.T 

* * 

GTTTT A TT A G C A AC T,AC AG C 



2150 
.2150 
21S1 
2150 
2150 




T T T C" GTTTTCTAC CT CTACTTCCTGT 
TTTCGTTTTCTACCTCTACTTCCTCT 
T TT C G T T T T C T A CCTCTACTTCCf GT 
T T TCCTTTTCTAC C.T C T ACTTCCTGT 
T T.T C G T T T T C T A C C T C T A C T T C C T G T 



G T TT TATTAGCAAC T. A C A G C 
GTTTTATTAG CAACTACAGC 
GTTT.TATTAGCAACTACAGC 
CTTTTATTAGCAACTACAGC 
GTTTTATTAGCAACTACAGC 



2150 

JL. 

T C TT 2603_ai2.seq 

TCTT 1.8rs21_ai2.se<j 

TCTT 515_^12.seq 

TCTT cJbllllallZ.seq 

TCTT fa36b_al2.seq 

A AC T Majority 
2200 

A A C T 2603__al2.seq 
A A C T 18rs21_al2.seq 
A A C T 515_ai2.seq 
A A C T .cjblll_ai2.seq, 
A A C T h36b_aJ2 f seq 



2200 
2200 
2201 
.2200 
2200 



AC AGC ATCCTTCATAGA TA T A C CGTAACCA GTTAGTGCTT T~T C C T TC T f. fi Majority 

2220 2230 2240 2250 



ACACCATCCTTCATAGATATACCGTAACCAGTTAG TGCT TTT GCTTCTCf amw 

acagcatccttcatacatatacggtaa c*c agttagtgcttttgc tt r t r r ^f 2 ;f q 

ACAGCATCCTTCATAGATATACGGTAACCA T I nH C f f 12 '^ 

ACAGCATCCT TC ATAGATATACGGTAACCAG an I \ ^ 

acagcatccttcatagatatacggt.aaccag"ac?g""^ 
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EP f !5- T 9 AACCTC AATTTCTCCCTTAT Uajority 



2250 
2250 
225 i 
2250 
2250 



2300 
230O 
2301 
2300 
2300 



2350 
2350 
2351 
2350 
2350 



2400 
24Q0 
2401 
2400 
2400 



2450 
2450 
2451 
2450 
2450 



2500 
2500 
2501 
2500 
2500 



2550 
2550 
2551 
2550 
2550 



2600 
2600 
2601 
2600 
2600 



AAAAATATAO ^ r ^ r ^ ^ M , ' ' T ? ° " " C ° T C A A ? ^ T C T C C C T T A ^ 2603_.al2.seq 
J^JIJ L^T I! ^. CCTTCAACCTCAATTTC ^ 18rs21__al2~L, 

AAAAA J A ^ A ^ GGTAATAAACCTTC A A C C T C A A TTTC T C C C T T A T 515 a!2 sea 

AAAAAT^ C c? ^"AATAAACCTTCAACCTCAATTTCTCCCTTAT cJbU^seq 

AAAAATATACTTACCAGCTAATAAACCTTCAACCTCAATTTCTCCCTTAT h36b_a72.seq 

CATCAGTTACTAA TGAA GTAATCCCATCT T G A TCGGTCCTA A ATCGTCCA Majority 



2310 
_t 



2320 
_i 



2330 

_JL 



2340 



2350 
JL 



2650 
2650 
2651 
2650 
2650 



C A T C A G T'T A CTAATPA^rTA^^^^^^^^^^ ^ ^ 2603_ai2.seq 

CA?CAG^ A r^ AA ^r AA r^ AA ^^ ATCTTG IS^aiCeq 
? A Jr A r^ A ^!!iy AGTAATCCCATCTTGATCGCTCGTAAATC ^ 515 ai2.seq 

CATCAGTTACTAATGAArT c JbTl l^at^seq 

CATCAGTTACTA AT G AAGTAATGCCATCTTGATCGGTCGTAAATCGTCCA h36b_ai2.seq 

TTTTTAAAGCGAAC.TGGCTGATTCTGG T TATCGTATAATACAAATATTAC Majority 
2380 2390 2400 

T^T^AArcCA^ 2603.al2.seq 
???^ AAAG rr A ^ C J GC ^ GATTCTGGTTATCGTATAATA ^ A 18rs2l_ai2.seq 
TTTTTAAArrriirTr^rT^^ ^ 515_al2.seq 

TTTTIAAAGCGAACTGGCTGATTCTGGTTATCGTATAATACAAATATTAC .h36b_aT2.seq 

TCCGGATAGCC TTTTCTTT ATCTTTCCTTCTTTTGTATATTTAATAAG.TT Majority 

. 24 , 10 24 . 2Q 2430 2440 2450 

TCCGGATAGCCTTTTCTTTATCTTTCCTTCTTTTGTATAtTTAATAAGTT yen**!* ^ 

T C CHG ATAGCCTT.TTCTTTATCTTTCCTTCTTTTGTATATTTAATAAGTT 1K^^ 

TCCCGATAGCCTTTTCTTTATCTTTCCTTCTtTiG 'ttS^ 

T " CG-ATAGC cQt TTC TT'TA-TCTTTCC T'T C T T T T gQ A T A T T*T AATAAGTT h36blal2 s^T 

TTAATCCCCCTGTTTCAACTT TTCGCTTACCATTTATCTCTAATTGATTT Majority 

- 24 ^° 24 7Q ' 2480 2 490 2500 ' 

TTAATCGGCCTGTTTCAACTTTTCGCTTAGGA T T TATCTG.TA ALTG ATTT 2603 a!2 sea 
T T a A t r ^ r r* 9,]£ G ^^^ GA *^^^^^^***' ! ^"* V AGGAT T"T A'T CTGTAATTGATTT 18rs21 al2Vseq 
TTAATCGGCCTGTTTCAACTTTTCGCTTAGGATTTAfCTGTAATTGATTt 515 a 12 sea 
TTAATCGGCCTGTTTCAACTTTTCGfclTAGGATTTATCTC T TTT cjbUl alTsea 

TTAATCGGCCT.GTTTCAACTTTTCGCTTAGGATTTATCTGTAATTG ATT T h^laT^seT 

GATAACTTATCATCTGGTAA T T C A A T AT A A A A A G G T A C T A TTCTTGA A A C Majority 



2510 

JL 



2520 



2530 

JL 



2540 

-JL 



2550 

JL 



?J J I J JI? I KAUATU * AA '" TACIATU "»* Jl < ; 2603_al2.se,, 

GATAACTTATCATCTGGTAATTCAATATAAAAAGCTACTATTCTTGAAAe 18rs21 ai2 seo 

GATAACTTATCATCTGGTAATTCAATATAAAAAGGTACTATTCTTGAAAC SlTaii iea 

G A T A A C T T A T C A T C T G G T A A T T C A A T A t A A A A A G G T A C T A T T G T t G A A A C clblu at^sea 

GATAACTTATCATCTGGTAATTCAATATAAAAAGGTACTATTCTTGAAAC Hob alt 



GCTTTCATC AGCTTT ATAACCACGACCAAAGTACCAACCAfTTCCCACTG Majority 



2560 



I 

2570 

. t 



2580 



2590 

-JL 



2600 

JL 



G G I I I G A I C A G C T T T A T A A G G A C G A C C A A A G T A C-G A A C C A T T T G C G A G T G 2603__al2.seq 

G CT T. T-G ATC-A GCTTX JUT A A .CCA C P A C C A A A G T A C G A A C C A t T T G G G A G T G ISr^a 2 seo 

G C T T TO ATCA.GCT T T A T A A CCA CGACCAA A C T AC GAACC ATT T G G GAG T-G ll^l i^sea 

GG jn GATCAGCTTTATAA ^AC C A CC A A A C T A C G A A C C A T T C ■ ] " a a^sea 
G C TTTC A T .C AC CXI T A TAAGC AC G A C-C A A A G TA C G A A CCA T T TG GG AG T G 

C T A T C T T T G T C T G A C C A TT AC T A T C AG TAGGAG.AAGTC AAGATA C T C TT A Majority 

2630 * 2640 

J U : _1 



2610 
i 



2620 

JL 



2650 

JL 



ZZ^ZZZZZ-ZZ TGACC ATTAGTATCAGTAGG AG A A G.T C A A G AT A C T C T. T A JJGOjJ.aia.seq 
nl *I ZZ Z Z Z T C T G * C C A T ? A C T A T C A G T A G C A G A A G T C A A G A T A C T C T T A 18r__i _l_!_eq 
rZ A I G IH^ TC I GACCATTACT - ATCACTAGGACA AGTCAACATACTCTTA 515 ali.seq 
CTATCT.TTCTCtGACCATTAGT A T C AGTAGGAGAAGT C.A A GATACTCTTA clbril al? sea 
CTATCTTTGTCTGACC A T T A C t A T C A G T A C G A G A A G T C > A G A T A C T C T T A 

TACTTCTGGTTC AATTCGCTATiCTGTC AT T TGCCTC A ATA A ATCAACTTT Majority 
2 y° ' 2 670 2680 2690 2700 

I A "I"5 CITCAATTCC " ATC ^ CTCAITTG «"<:""AATCA*CTTi 2 603 aI2. Seq 
J« C J^I^ TTCAATTCGCTATCTGTCATTTGGCTCA ATAAATCAACTTT tSrsil a^^eq 
Z ^ "TCAATTCSCI.ATCTCTCATTICCCTCAATA4ATC AACTTT 515 alS.sen 

TACTTCTCCTTCAATTCCCTATCTGTCATTTGGCTCAATAAATCAACTTT cjbllLal^seq 
T A C T T C T C G T T C A A T T C G C T A T C T G T C A T T T G G C T C A A T A A A T C A A C T T T h36b al2.seq 



FIGURE 19E 
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Weighted residue weight table. 



62/4^ 



CT/US2005/027239 S 



a 



2700 
2700 
2701 
2700 
2700 



2750 
2750 
2751 
2750 
2750 



2800 
2800 
2801 
2800 
2800 



2850 
2850 
2851 
2850 
2850 



2900 
2900 
2901 
2900 
2900 



2950 
2950 
2951 
2950 
2950 



3000 
.3000 
3001 
3000 
3000 



CCCCTCCTCTGTAC Majority 



2740 



2750 



T A A C T TC T C AC T C A C, AG T£ C A .T A,A„A. C...C A T A A G A A A T 

• P C^tW " 5 " 5 ^ji:;;:: > ,c :;n 27 * 3Q 

TAAGTTGTCAGTCACAGTCCATAAACGATAACAAATCCCCTCCTCTGTAG 515 a 12 sea 
Jtt^^y^5 TCACAGTCCATAAACGATAAGAAATCCCCTCCT CTGTAG cjblll alTseq 
TAAGTTGTCAGTCACAGTCCATAAACGATAAGAAATCCCCTCCTCTGTAG h36b^_a72.seq 

TATTr'GGCT CAAGT CCTATCTGTCTCATTGTTAGTT.GATTAGCCCTATCA Majority 



2760 



2770 



2780 
_i_ 



2790 
_i_ 



2800 
J_ 



3050 
3050 
3051 
3050 
: 3050 



3100 
3100 
3101 
3100 
3100 



I A IIJ G 55I CAAGTCCTATCTGTGTG ATTGTTAGTTGATT-AGGCGTATCA ISrsil a 1 

TATTTPrrTMirTrrTAT^^^ cJblU^seq 
TATTTGGCTCAAGTCCTATCTGTGTGATTGTTAGTTGATTAGGGGTATCA .h36b_ai2.seq 



CCATTTA CACTGGCT ACCCAAAAAAACCCTAATTGT AC CAATCCTAAAAA Majority 



2810 



2820 



2830 

JL 



2840 

_l_ 



2850 

JL 



rrAXTT^r^rxr^rT^^^^^^^ AA 2603_al2 .sea 

J ^ ! AUAAACGCTAATTCTACCAAK 18rs21_al 2 lea 

^^TTTACAC^GG^TACCGAAAAAAACCCTAATTGTACCAATCCTAAAAA 515_al2.seq 

•GCATTTACACTGGCTACCGAAAAAAACGCTAATTGTACCAATCCTAAAAA clblH al2 sea 

GCATTTACACTGGCTACCGAAAAAAACGCTAATTGTACCAATCCTAAAAA Sob Hit !seT 



G C 



AACATAGTAGA 



A G T C C T 



A A 
i — ~ 



A C 



2860 

,- , j 



GCAACATA'GTAGAAGTCC 
GCAACATAGTAGAAGTCC 
GCAACATAGTAGAAGTCC 
GCAACATAGTAGAAGTCC 
GCAACATAGTAGAAGTCC 



2870 
— t 



TTTTTC TA ATCTTT TT- C A TTTTTCATT Majority 



TAA ACTTTTTC 
T A A A C T T'T T T C 
TAAACTTTTTC 
TAAACTTTTTC 
TAAACTTTTTC 



2880 
i 



* 

2890 



TAATCTT T T T C 
TAATCTTTTTC 
T A AT C T T T T T C 
TAATCTTTTTC 
TAATCTTtTTC 



2900 

L_ 



A T T T T T G A T T 
ATTTTTGATT 
ATTTTTGATT 
ATTTTTGATT 
ATTTTTGATT 



2603__al2,seq 

18rs21_aI2.seq 

5I5_al2.seq 

cjblll_al2.seq 

l»36b_al2:seq 



TC CCTTTCTCT.TTCTCTCTTTA A A T T T T C G T T T T A A A T A T A A T A G T A A A G 
29 , 10 . / 2920 2*g30 2940 2950 



Majority 



TC C C T TTCT-C.TTTCTCTQTTT A A ATTTTCGTTTTA AATATA ATAG TAA A G 
TCCCTTTCTCTTTCTCTCTTTAAATTTTCGTTTTAAATATAATAGTAAAG 
TCCC.TTTCTCTTTCTCTCTTTAAATTTTCGTTTTAAATATAATAGTAAAG 
TCCCTTTC KT T T C T C T C T".T.T AAATTTTCGTTTTA AATATAATAGTAAAG 
tcc ctttctHtttctctcttt A AATTTTCGTTTTA AATATA ATAGT A A AG 



2603_.al2.seq 

18rs21_jU2.seq 

515_al2.seq 

cjblU„jai2.seq 

h36b_a!2.seq 



CGACTAATATAAGAATAACTA GGATTGATAACAGCAAATAAACTTTATAC Majority 

2990 3000 

J L. 



2960 
J. 



2970 

JL 



2980 



CGACTAATATAAGAATAACTAGGATTGATAAGAGGAAATAAACTTTATAG 
CGACTAATATAAGAATAACTAGGATTGATAAGAGGAAATAAAGTTTATAG 
CGACTAATATAAGAATAACTAGGATTGATAAGAGGAAATAAAGTTTATAG 
CGACTAATATAAGAATAACTAGGATTGATAAGAGGA'AATAAAGTTTATAG 
CGACTAATATAAGAATAACTAGGATTGATAAGAGGAAATAAAGTTTATAG 



2603__al2.seq 

18rs21_al2.seq 

515_ai2.seq. 

cjblll_al2.seq 

h36b_a!2.seq 



TGTCTTTGCAATTCT T T C 




3010 



ATTAAATACTTCTTT T 

I— : 1 : : 



TGTGTTTGC AATT.C T..T T C 
T G.TG T.T'T G C A A T T CTTTC 
TGTGTTTGC A At TC T t TC 
T G T G T TTGCAA T T C T T T C 
.TGTGtTTG CA A TIC T.T T C 

• ■ 

AT A C'TT GAT T C G A T G C C C 



3020 
i 



CTTTAAC AGG AGG 



T A 



.3030 
i 



3040 
— f 



C Majority 
3050 



ATTAAATAGTT-CTTTTC T.T T A A C A G G AG G 
ATT A. A' A T A G T T C T T t T C T T t A A C A G G A G G 
A T.T A A A T A G T T. C T T T T C TTt AACAGG.AGG 
ATTAAATAGTTC TTTT.C T ft A A C A G G AGG 
A T t A A A T A G T T C T T T T C T T T A A C A G G AG G 



T A C 2603__al2.seq 
T A C 18rs21_al2.seq 
T A C 515_al2. seq 
t A C cJbUl^al2.seq 
T A C K36b._ai2.seq 



.3060 
— a 



T € TAA C T A G T A A A C C ATCTG.A AT T A A T C G 



A T AC- TTGATTCGATGCCC 
■AT AC TTGATTCGA. TC CCC 
ATACTTGATTCG A"T GCCC 
ATACTT G A T T C*G A T G C C C 
ATACTTGATTCGATGCCC 



3070 
i 



— r 

3080 
i 



3090 



A AT Majority 
3100 



TCT-AACTAGTAA 
T C T A ACT. AGTAA 
T C T A A CTAGTAA 
T C TA ACT AGTAA 
TCTAACTAG TA A 



ACGATGTGA AT T A AT C G 
ACGATGTGAATTAATCG 
ACGATGTGAATTAATCG 
A C G A T G T GAATTAATCG 
ACGATGTGAATTAATCG 



A A T 2603_ai2:seq 

A A T 18rs21_al2.seq 

A A -T 515_ai2.seq 

A A T cjblll„ai2.seq 

A A T h36b_al2.seq . 



A A C G T G T AC ATGTTAGCA 



3110 
— i 



AAGGTGTACATGTTA GCA 
AAGGTGTAC ATGTTAGCA 
AAGGTGTAC ATGTTAGCA 
AAGGTGTACATGTTAGCA 
AAGGTGTACATGTTA GCA 



AAGTCGC.ATAATCCTTACCTT TAACAACC 

: I - : — I 

3120 3130 
«-: i 



A A T 

_ 



Majority 



3140 
— i — 



A.AGTCGCATAAT.CCTTACCTTTAACAACC 
A A G T C G C AT AATCCTTACCTTTAACAACC 
AAGTCGCATAATCCTTACCTT TAACAACC 
AAGTCGCATAATCCTTACCTTTAACAACC 
AAGTCGCATAATCCTTACCTTTAACAACC 



3150 

f— r - 



A A T 

A A T 

A A T 

A A T 

A A T 



2603_al2.seq 

18rs21_al2.seq 

5l5_al2.seq 

cjblll_al2.seq 

h36b_al2.seq 
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Weighted residue weight table 



A A T T T 



CT/US2005/027239 2 8 



3150 
31SO 
3151 
3150 
3150 



AA?^ A ^ A A AAA ^i A ^ TCGCTTTACAACACTTATTTG 

AATTTACAAAAATTATCTGGCTTTACAACACTTATTTGATCAACCTT 



ATA 
ATA 
ATA 
ATA 
ATA 



Majority 



2603_al2.seq 

18rs2l_ai2.seq 

5I5_ai2_seq 

cjblll_ai2.seq 

h36b_al2.seq 



GGCTAAAACTTCTTTGATATTATGAATATAAAA A ATTTTTCCTTTTTTAA Majority 

3210 1 " *~ - 

J 



3200 
3200 
3201 
3200 
3200 



3220 
* 



3230 



3240 
.x. 



'3250 
J. 



GGCTAAAAC TTCTTTGATATTATGAATATAA AAAATTTTTrrTTT'XTT a 1 ' 
CGCTAAAACTTCTTTCATATTATGAATA ■ .J I i"^ 

CCCTAA AACTTCTTTGATATTATGAATATAAAA AATTTTTCc" 

ggctaaaacttctttcatat.tatgaatataaaaaatttttcctttttvIj k7? I 2 '?? 

GGCTAAAACTTrTT-rrATATTAT^* -^V^VVV ^t-TTTTTTAA cjbi 1 l_ai2.seq 

UWAAAACTTCTTTCATATTATGAATATAAAAAATTTTTCCTTTTTTAA h36b_aI2.seq 



3250 
3250 
3251 
3250 
3250 




GTTTATCTAAATCTGTAA 26Q3_al2.seq 

G T T T A T C T A A A T fc T G'T A A A ISr^alHeq 

GTTTATCTAAATCTCTAAA 515_ai2.seq 

ctttatctaa-atctgiaaataacttagctttagctaagccgcgatgagctS^ 



3300 
3300 
3301 
33 00 
3300 



GTGATAACACTATGTCAACTTTTTCCACCAAT TGGCAAGGAGGTTCCTTr Majority 

_ fSig fgg? 3330 334 0 3350 

CTGATAACACTATGTGAACTTTTTCCACCAAtTGGCA AGGACGTTCCTTr 

GTGATAACACTATGTGAACTTTTTCCACCAATTGCCA BgaBgt TCPT^r 

SK^l^i?!!-;?!"*** nTTCCACCAATUGCH?GK 
GTG ATAA.C A G.T.A TGTCAACTTTTTCCACCAATTGGCA A0 G Afi GTTC GTTC 



2603_ai2.seq 
16rs21i.al2.seq 
515_jal2.seq 
cjblll_ai2.seq 
h36b_aI2.seq 



3350 
3350 
3351 
3350 
33SO 



AAGGTGTCCTCCTCCTTTTTCAAGAACACTACT GGTAGTCGCCGCATAGA Majority 

3360 ^70 3380 3390 * 3400 



AAGGTGT C C T G C? C C TT T I I C A A C A A* ° t £ ^ ^ T ° * T A ° T ^ ^ ^ C € C A T A G A ^^.seq 

AAnTG^CC^cH^CG^?^ 18^21^12^, 

AAGGTGTCCT G C T "C C^TT^r^r^ 0 .^*^* 00 *^^^ SlSLal^seq 
, 7 T .^itHttTCCTTTtTCAAGAAfiACTACTC G T ACTCCCCGT AT A r A «ihiti 

*>Oai e iCCT 6 CTCCTITTICA*GAACACIACT.C 6 "«Jcc C C CGC C i?Ao[S!^ 



3400 
3400 
3401 
3400 
3400 




TAGGTAATTTT T.G CTTGATAGACGCTATATCAATATATCC AA T C A T T T r i 
TAGGTAATTTTTGCTTGATAGACGGTATATCAAT T U 
TAGGTAAT.TTTTGCTTG ATA GACGGTATATCAATATATCC'AATCATTTC A 

?AG G T AA . T n^ GCTTGATAGA ^ C 

TAGGTAATTTTTGCTTGATAGACGGTATATCAATATATCCAATCATTTCA 



2603_aI2.seq 
18rs21_aI2.seq 
S15_ial2.seq 
cjblll_ai2.seq 
h36bi.ai2.seq 



3450 
3450 
3451 
3450 
3450 



3500 
3500 
3501 
3500 
3500 



GCAATCTCAA CCATCTC GGC G TATTCAGCAA T A C CT TTTTTTTCT TTTTC Majority 

3460 3470 3480 3490 - 3500 



G C A A T C*T CAAGCATGTGG GC G T ATTC A*G CA At ACCTTT T T T T T c'T T r t t r ?t i 
G JgL A *TGTC A.AGC A T G/T GGGC GTA T'T C A'^ C. A ATA C-JC TT'TT T T tTCTTtttt fE££SiTL 
cBaA.TCTCAAGGA tBt G G.G C GTA TTGAG C A AT AC C T T TTTTT T C 14 T T I £ ^ 
G C ' A ATC..TC AAG/eATGTGG GC G T A.T T C A G C A A T A C C T t T T T T TTCT T T T T d wTl a^?^«cAri 

gcaatgtcaa.gcat:g-tgggc.gtattCag : caatagc??^ 

A GT A T A G CG A TC TCATACCCGGCTTGC G T C C A GTG TtCTAT TATA A G r. T ¥ u^ t + y 
35 . 10 . 352Q . . 353Q 3540 3550 

AGTATA^fMTPT r * A -r ^ ^ GGCTTGGG T CCA G t*G IT C t A T T A T A AG CTT 2603 ai2. sea 
ACTATA.GGGATCTGATAGGCGGCTTGGGTCCAGTGTTCTATTATAAGfTT ir^ST tlTl 
A GTA TAGG.GATCTG-ATAGG.CGGGTTGGGTCCA GTG TTCTAT TATA A^-TTT iff 2 *- 312 ^ 
AG TA.TAGGGATCT G"A TACGCGGCTTGGGTCCAGTGTTCTAn I? 
AGTATAGGCATC, T G A. T A G G' C GG.CTTGGGTCC AGTGTT C- T ATT ATA AGCTT hS^b^al^seq^ 



3550 
3550 
3551 
3550 
3550 



TTCCTAACTCAAATCGTCTATTAATCTCTTTAGTATTTA ATTTTT C C CTT ?«av» 
TTG CTAACTCAAATCGTCTATTAATCTCTTTAGTATTTAATTTTT GGGTT lSra^T f 

?^G C ?AAr^r AAA . CCTCTA 

TTGCTAACTCAAATCCTCTATTAATCTCTTTAGTATTTAATTTTTPf ffT ^ikTVi 7*^ 
TTGCTAACTCAAATCGTCTATTA'ATCTCTTTA gQ ATTTAATTTTTGGGTT hSGbuaT^seq 6 ^ 



FIGURE 19G 
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etn 



Weighted residue weight table. 



6 t+liftl 



3600 
-3600 
3601 
3600 
3600 




3650 
3650 
3651 
3650 
3650 



3700 
3700 
3701 
3700 
3700 



3750 
3750 
3751 
3750 
3750 



3800 
3600 
3801 
3800 
3800 



3850 
3850 
3851 
3850 
3850 



3900 
3900 
3901 
3900 
3900 



TGATTATC 
TGATTATC 
TGATTATC 
TGATTATC 
TGATTATC 



AAAGTTAGTTACTTGATTATTAGCTTTAATATTA 
AAAGTTAGTTACTTGATTATTAGCTTTAATATTA 
AAAGTTAGTTACTTGATTATTAGCTTTAATATTA 
AAAGTTAGTTACTTGATTATTAGCTTTAATATTA 
AAAGTTAGTTACTTGATTATTAGCTT T A ATATTATA 



TAGTACCA 
TAGTACCA 
TAGTACCA 
TAGTACCA 
G T A C C A 



CT/US2005/027239e9 

Majority 



2603_al2.seq 

18rs21_al2.seq 

515_al2.seq 

cjblll_at2.seq 

h36b_al2.seq 



ATTTGAAATAAAAGGATATGAGCTTATCAAA A GACCAACTAAGAACAATA Majority 

3660 36 ™ 3680 3 690 370O 



A ^Jr A ^! A M^! TATGACCTTATCAAAACACCAA ^AAGAAC 

4^^^^ A * A ^ AAAA ^^^ TAT ^^ GG T TAT CAAAAGACCA"ACTAAGA AC 
ATTTGAAAT.AAAAGGATATGAGGTTATCAAAAGACCAACTAAGAAC 



3950 
3950 
3951 
3950 
3950 



4000 
4000 
4001 
4000 
4000 



A A T A 

A A T A 

A A T A 

A A T A 

A A T A 




CTATCA'ACCfTAriTTri^ G^T 
mill P AA Mrr^ H TT ATCG ATTTA * A A C G A C C G A T T T C T T A A G G 

cl aI r AW&rrl x TTC A TCC ATCG ATTTAA AACGACCCATTTCTTAAGG 
nZ - I S m ° CCTACA ^TCATCCATCGATTTA A AACGACCCATTTCTTAAGG 
GTAT.CAACCCTACATTCATCCATCCATTTAAAACGACCG TC 



2603_al2.seq 

I8rs21_al2.seq 

515_ai2.seq 

cjblll_ai2.seq 

h36b_ai2.seq 



Majority 



2603_ai2.seq 
-18rs21_a 12 Iseq 
515_ai2.seq 
cjblil_al2.seq 
h36b_ai2.seq 



TTTTTCTGAAATTTTCCTCCCATTATGATTC AATTCCTTTTCTAACATTT Majority 

3760 3770 3780 3790 3800 



TTTTTCTGAA 
TTTTTCTG AA 
TTTTTCTGAA 
TTTTTCTGAA 
TTTTTCTGAA 



^J^r^rr n A IJ ATTCAATTCCTTTTCTAACACTT 2603_ai2.seq 
ATTT-TrrT rW pitt*^ ^ ^AATTCCTTTTCTAACACTT 18rs21_al2?seq 
ATTTTTPT rr pj^tt^to A J % ****'. AA TTC CTTTTC T-AAC ACTT 5l5_ai 2 .seq 
ATTTTCCTCCCATTATG ATTC AATTC CTTTTC T AA Cj A C T T cfblll ai2 sen 
TTCCTCCCATTATG ATK A A T T C C T T T T C T A A C ACTT X 



Majority 



A T 




G C T A A ACGATTTTTTT G A 
C£ TA AACG ATTTTTTT'G A 
gHt.a AACGATTTTTTT G A 
GCTA A ACG ATTTTTfT GA 
GCTAAACGATTTTTTTGA 



CGTTGACGTTTTATTAACCAAAGT 
C.GTTG AC G*T TTTATTA AC CAAAGT 
CGTTGACGTTTTATTAA CCA A A G T 
CG.TTGACGTTTTATTAAC CAAAGT 
CGTTGACGTTTTATTAACCAAAGT 



A A C C 
A A C C 
A A C C 
A A C C 
A A C C 



A 
A 
A 
A 
A 



AG C 
A G C 
A G C 
A G C 
A G C 




AATAATAAPTAAAr^T"AT^T^S A A B ACATATCTATAAAT CGTGTTTAAAT' 
A A T A A T A A C T A A A G A T A T.A T A- G A AQA GATATCTATA AAKGTGTTTAAAT 

AA '3 AA I AA ^ AAAGATATATAGAAAAGATATCTATAAA TC G'T'G TTTAAAT 
AA ^ AA I AA ^ AAAGATATATAGAAAAGATATCTATAAA TCG 

AATAATAACTAAAGATATATAGAAAAGATATCTAtA-AATCGTGTTTA A A T 



•2603_ai2.seq 
18rs2IJal2.seq 
5l5_al2-seq 
cfblll_ai2.seq 
h36b_ai2.seq 

Majority 



2603_ai2.seq * 

18rs21_ai2.seq 

515_ai2.seq 

cJbHl_ai2.seq 

h36b_al2.seq 



GACCGTCTTTTATT A' A TIT TTCATCAATAr.fi ft C C.T T T A T A A C G C A T A O G A Majority 

3910 3920 3930. . 3940 ~3M 0 



GACCCTCTTTTATTAATTTTTCATCAA T AG G A CCTTTATAACGGATACGA 2603 2 ^ 

g^cgtcvStIa?^ 

C A C tnl o l + ln 1 1 I « A 1:1 J J J C A T C A A T A C G A C C T T T A T A A C G G A T A C G A SiS.lalZ.seq 

G A C cH t C T t tB t t- a a ? I I 11 ? ! I S ^ ? T X ^- C A C C T T A T A A G G|l A T A C G A ojblllla^seq 

G A C CU T C T T TH A T T,A A T T T T T C A T C A A T A C G A C C T T T A T A A G gH A T A C G A h36b_al2.seq 




Majority 



tGTCCCCTTA^TA^Mr^ A fi - GA ** , ' AA '*''*-''** < *GGTGCAAGT' 2603^.1 2. seq 

^GIICCCC^A Af A ^ G 3^, G ' GTGTATTGAHcATAATCGGGGTGCAAGT ^ aiE^eq 

?CTCCCC^A??t A a lS^"^HSI ATTaAC?ATAAT ^ GCG GTG C A AG ^ «5^2-seq 

TGTCCCCTTA. CTAAAAGTCTGTGTGTATTG AC; CATAATCGGGGTGCA ACT h36b al2.seq 




CAATAAGGTTGCATAATCA T G T CCAGGAAC 
C.A ATAACGTTGCATAATCATGTCCAGGAAC 
CAATAAGGTTGCATAATCATGTCCAGGAAC 
DA ATA AGGTTGC AT A*A TCATGTCCAGGAAC 

CAATAAGGTTGCATAATCATGTCCAGGAAC 



AACCAACAAATCTGA A.'A A G T 
AACCAACAAATCTGAAAAGT 
AACCAACAAATCTGAAAAGT 
AACCAACAAATCTGAAAAGT 
A A C C A AH A'AATCTGAAAAGT 



2603_al2.seq 
18rs21_aI2.seq 
515_ai2.seq 
cjblll_al2.seq 
h36b_ai2.seq 



FIGURE 19H 
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Weighted residue weight table. 



CT/US2005/027239io 



TAT 



c g c ^prf-yc Eff MBST TTier a" T ::c iwg »ta p jfe cttca-tatcctatcctttc t^^Z 



4050 
4050 
4051 
4050 
4050 



4060 
-ft 



.4070 



4080 
ft. 



4090 



4100 

JL 



4100 
4100 
4101 
4100 
4100 



4150 
4150 
4151 
4150 
4150 



4200 
4200 
4201 
4200 
4200 



4250 
4250 
4251 
4250 
42SO 



4300 
4300 
4301 
4300 
4300 



4350 
4350 
4351 
4350 
.4350 



4400 
4400 
4401 
4400 
4400 



4450 
4450 
4451 
4450 
44SO 



iA^^ CTGTA ^ CGACTTTTATCTGATCTACT TGATATGCTATCGTTTCT 2603 a!2 seq 
T*3r^^I G 3 AA ^ ACTTTTATCTG ATCT ACTTG ATATGCT ATCGTTTCT 18rs21 aiHeq 
TATCGGGTGTAACGACTTTTATCTGATCTACTTCATATGCTATCGTTTCT 515 ai2 sea 
TATCGGGTGTAACGACTTTTATCTGATCTACTTGATATGCTATCGTTTCT clbTll a!2seo 
TATCGGGTGTAACGACTTTTATCTGATCTACTTGATATGC.T ATCGTTTCT t£l[lll s^P 



TT T A T G T T T T G A A T ATA AAKTT ATCTC CTTTTTTT AACTTTTTA AC C TT Majority 

4150 

J. 



4110 



4120 
JL 



4130 

-L- 



4140 

JL 



T^T^T^?^r AA ? A ^ AAA " TA ^^ ^uiSeq 

Ill^ir TTGAA TATAUACTTATCTCCTTTTTTTAAC T TT TTAAGCTTS1S al2 seq 
TTTATGTTTTGAATATAAAAeTtATCTCCTTTTTTTAACTTTTTAACGTTcjbTlI aftTsea 
TTTATGTTTTGAAT.ATAAAACTTATCTCCTTTTTTTAACTTTTTAAGGTT h36b_a72 ! seq 

AGAAAAGAGTTCTTTATCT GGAATTCCTGAATGCGCTGTTATAACAGTAT Majority 
4l t 6Q 4170 4180 4190 4200 

A ^ AAAA S AGTT€TTTATCTGGAA ' TT al2.seq 
A ^ AAAA ^ A G W T CTTTA TCTGCAAtTCCTGAATGeGCTGTTATAACAGTAT 18rs21 al2^eq 
AGAAAAGA gHt CTTT ATCTGC A'ATTCC Tfi ABTGC GCTGIT AT A A cWg TAT 515 ai2 seq 
AGAAAAGAGTTCTTTATCTGGAATTCCTGABTGCGCTGTTATAAC|BGTAT cjblll aiFseq 
AGAAAAGAGTTCTTTATCTGGAATTCCTGA.AT G'Q GC'TGTT ATAAC AG T A T h36b_aT2 \seq 

GTGTACTATTGCCTCCAAT TGGAAGAGAGG TA CCTTCTAAATGCCCTGCT Majority 

4230- - - 4240 . 4250 
-I : 1 ; : u 



4210 

JL 



4220 
-ft. 



G T G. T A C T A T T C C Cl|c C A A T T G G A A G A G A G G T A CCTTCTA A AT.G CCCT GC T 2603„al2 seq 

W G T W C T A T T G C CHC CAATT GGAAGAGAGGTACCTTCTAAATGCCCTGCT 18rs21_al2 seq 

^ T |3 C T A T T G CCTCCAATTGGAAGAG AG GTA.CCTTCTAAAT gH CCTGCT 515 alEJseq 

SIS IT^ T A T T B C C T C Q A A TT G G A A G A G aQg TA CCTTCTAAATGCCCTGCT cjbUl alTseq 

G T.G T A C T A T T GC C T C C A A T T C G A A G A G A G C T||C C T T C T A A A T G C C C T G C T h36b_ai2.seq^ 

CCTTTAGATAGAACTTCT T G A C TT G A A C 

i — • 

4260 . . 4270 

-ft 



CTGC A AATATAGGGACTTTTTG Majority 

4290 ' . 4300 



4280 



C C TT T A G A T A G A A C T T C T T G A C T T G A A C C T GG.A AATATAGGGACTTTTTG 2603 ai2 seq 

CCTTTAGATAGAACTTCT T.G ACTTGAAC CT G CAAATATAGGGAGTTTTTG 18rs21 aii^sea 

C.CTTTAGATAGAACT T C T T.G ACTTGAAC. CTGCAAATATAGGGAGTTTTTG SIS ail seq 

CCTTTAGATAGAACT tBtT fi ACTTGAAC CT.CC A AATATAGGGACTTTTTG cjbln_ai2 seq 

GCTTTAGATAGAACTTCTTGACTTGAACCTGCAAATATAGGGAGTTTTTG h36b_ai2 seq 



ACCTATCTTAGGAACTGAAATT GTGCCGATTTTTTCACTTACCTCTAACA Majority 



~r 

4310 
i 



i 

4320 
■J- 



i 

4330 
JL 



4340 

JL 



4350 

JL 



A C C T A T C T T A G G A A C T G A A A T TGT*i C C G A fTT TTTC ACTT ACCTCTAAC A 2603 a!2 seq 

A ^ ^ A I °T I A G G A A C T G A A A T T G ISrs^ai^seq 
ACCTATCTTAGCAACTGAAAT tBt G C C G AT TTTTTCACTTACGTCTAAC A 515 ai2 seq 
AGG I ATCTTAGGAACTGAAATTG TCCCGAT^ cJbUl ai2 seq 

ACCTATCTTAGGAACTGAAATTGTGCCGATTTTTTCACTTACCTCTAACAh36baT2 seq 



TACGGGCG 



TACTCTGCTA C..C CCCTT-TT 



4360 
■ - 



4370 
i 



GAATTCGTTTTTTCTC A T A A 

• - - 1 i : 



4380 



4390- 



T A CGG GCG T.ACTCT G CTACC.CC C.TTTT GA'A'TTCGTTTTTTCTCATAA 
T A <; G G G C G TACT C T GC TAlf C C C TTTT G A A T T C 0 T T TTTT C T C A T A A 
T A C GGCCGTA CJC T.G-C TAflcCC G TTTT G A A TIC G TT TT TT C T'CATA A 
TACGGGCGT AC'TCTGC TA CC C C C T T T -T G A AT T C_G TT Til T TCTC ATA A 
TACGGGCGTACTCT.GCTACCC GQ T T T T G A A T T CG T TTTT TCTCATAA 



G G A Majority 
4400 



T CT T C A A G ATGG A'C ATT AT TT A- A A G AlA-T C A T t A T A A G C T T C T G C T A G 

i. i i . . . " i • 8 

4410 . - 4420 4430 4440 
«-■ J — 1 : i - : 



G G A 2603 i _ai2.seq . 

GG A i8rs21_ai2-seq 

G G A 515_al2 .seq 

G G A cjblll^al^.seq 

G.G A Ii36b_ai2_seq 

A G C Majority 



T 

4450 
l_ 



AG C 
A*G C 
AG C 
A G 



2603_al2.seq; 
18rs21_al2.seq 
515_ai2,seq 
cjblll_al2.seq 



T C T T C A A G A T G G A C A T T A T t T. A A. A G A. A T C A T T A T A A G.C T TGTQCT AG 
T C T T C A A.G A TJG G A C A T T A T T T A A A G A A T C A T T A T A A G C T T G T G C T A G 
TCT TC A A G A T G G A C A TT.ATTTAA AG A ATC ATTATA|gCTTGT0CTAG 
T C T T C A AG ATGGACATTATTTAAAGAATC A'T T AT AAGCTTGTGCTAG 

T C T T C A AH A T GG A C AT T A T T T A A A G A A T C A T T A T A A GGT t C T G C T A G A G C h36b^ai2.se<i 
C A T A C G T C G A T T C A T T T C T T T C T G A C T A A G T T T T T f A G C.A G C T C T C T C A A Majority 
44 . 60 44 7<> 4480 . 4490 4500 

CATACGTCGATTGATTTCTTTCTGACTAAGTTTTTTAGCAGCTCTCTCAA 2603 ai2 seq 
C A T A C GTC GATTGATTTCTTTCTGACTAAGTTT. TTTAGCAGCTCT-CTCAA 18rsSl_al2 seq 
C A T A C GJJC GATTGATTTCTTTCTGACTAAGTTTTTTAGCAGCTCTCTCAA 515_al2.seq 
C A T A C CBC G A T.T G A T T T C T T T C t G A C T A A G T T T T T T A C C A G C T C T C T C A A cjblll a!2 seq 
CATACGTCGA'TTGATTTCTTTCTGACTAAGTTTTTTAGCAGCTCTCTCAA h36b al2.seq 
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vvrth Weighted residue weight table. 
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CTATAGTAAAAACGTGATACC Majority 



4500 
4500 
4501 
4500 
4500 



4550 
4550 
4551 
4550 
4550 



4600 
4600 
4601 
4600 
4600 



4650 
4650 
'4648 
4647 
4647 



.4700 
4700 
4698 
4697 
4697 



4750 
4750 
4748 
4747 
4747 



4800 
4800 
.4798 
4797 
4797- 



4850 
4850 
4848 
4847 
4847 



AATCCTGTGTTTrATTATT-^r 4ttpt ^ GTGATACC 2603.ai2.seq 

A A I cml G^^^W^!^!r^L CTATCGTATACTAAAA i CGTGATACC 18^21^12^ 

A A T CC T C T C T T T t^tta^ TATCGTATACTA AA a(J CCTCATACC 5i5_.ai2.seq 

AATCCTGTGTTTGATTATTAGATTC TATCGTATACT A A A A ACGTCATArr ^ifvTi i ,o 

AATCCTGT G'T TTGATTAT TQc ATTCTATCGTATACTAAAAACGTGATACC ^"la'seT 




4900 
4900 
4898 
4897 
4897 



ACTCCATAPAATaaiit! 2603_ai2.seq 

ACTGCATATAATA a IB i t a J ! I ! J ! S C T AT T " A GAAAAAGAATGA ? AAAA GG ^i.al^L, 

AC?CGA?ArM^A A T A i A ^ A ^ 515_ai2.seq 

ACtScatat A - A taaA^Ta^ * I CTATTAGAAAAAGAATCATAAAAGG cJblll.aiTseq 

ACTGGATACAATA A-A ATAGATAGACCTATTACAAAAAGAATGATAAAAGG h36b_al2.seq 

Majority 
4650 

GATTTTTTTAGTCT 2603_ai2.seq 
GATTTTTTT AGTCT 18rs21_ai2.seq 
ATTTTTTTAGTCT 515_ai2 .seq 




A AG ATTTGAC T T CTTCTTTTT 
AAGATTTGACTTCTTCTTTTTi 
HAGATTTG A|| T T C T t|| T TT T T 
AAGATTTGAC T.T CTTCTTTTT 
A AG ATTTGACTTCTTCTTTTT 



r r i 
r i i 



T 
T 
T 



GTTTTTTTGAT 
GTTTTTTTGAT 
GTTTTTTTGAT 
GTTTTTTT g|| T 
GTTTTTTTGAT 



GAT 
GAT 



BTTTTTACTCT cjblll_al2_seq 
TTTTT AGICT h36b_ai2.seq 




T C A C G A^r?rr? A H A I * A I r ° C T C T T CCTTATGA.TCTA AG AGTACTTC 2603_ai2.seq 

TC ACGTCATCTCCTAIBATA^TGGCTCTT.GCTTATGATCTAAG AGTACTTC lSr<9l ..n^ 
TCACGTCATCTCC TAAATA ATGG CTCTTGCTTATGATCTA AG AGTACTTC SlTlks^ 

?CACG?CA^C^CC^A^ 

TCACGTCATCTCCTAAATAATGGCTCTTGCTTATGATCTAAGAGTACTTC h36b_a!2.seq 
T A C T G A A A T A C C C T T A G A T C A T A A C C A C A CCT.T TAACTGT G C T T ATAC AT Majority 



4710 



4720 



4730 

JL 



4740 



4750 



?AC? r A A A^ tltt ATC ATA AGC AC ACCTTTA.ACtGrtGC TTATACAT 2603 ai2.seq 

T.A C T G AA AT ACCCTTAGATCATA ACCAC AGCTTTAACT gHg CTTATACAT ISrsZl »i?\L, 
J A JI^ A ^ ATA ^^**^ TAGA TCATAAGCACAGCTTTAACT **T G'CTT aQ Ac AT 515_2l^sea 
I^I$ AAATACCCrrAGATCATAAC CACAGCTTTAACTGTCCTT^ 

TACTGAAAT-ACCC TTAGATCATAAGCAC A GCTTTAACTGTG. CTTATACAT h^lal^eq^ 
CATCAAACACTACCCTTAAGCTTCCTC T GA;TTGACGTC.TTTTCATGATAA Majority 



4760 
JL 



4770 



4780 



4790 



4800 



CATCAAAGACTAGCCTTAAGC 

C T AC CCTTAAGC 
CTTA AHc 



CATC 
CATC 




T.T 



_ C T AES _ 
CATCAAAGACTAGCCTTAAGCTT 
CATCAAAGAGTAGCCTTAAGCT.T 



rfi 



T T C C t 
T T C C T 
C 

C C T C T G AT T G A C 
C 



EE t» 


D 











CTCTGATTGACG T C T TTTCATGATAA 515_ai2.seq 



THT T TTCATGATAA cj bl ll__ai 2. seq 



C T C T G A T T G A C G T C T T T T C A T G A T A A h36b_al2.seq 



C TACTGCTCC AAGCATAA TGCTTAAA C.C AATAATTGTCAAAAGAATTGTA Majority 



4810 

JL 



4820 
i 



4830 
_L 



4840 
JL. 



4850 



C t a r T r rl r r !!"f!!*?^ nA '* A ? C A * T A A T X. G T G A A A. A GA A t T G T A -GQS^ia.seq 
°< I A ~wa G c T P P.* A C C A T.A A TM IT A A AC C, A ATA A T T G T G A A A A G A ATT G T A IfSisZt al2sea 

C T A c9 G _ I c c A a r r a Wt'SiS J I _ *■* C A A T A A A A H * ATT C * A.STlt^T 

_ ^ a" Re ^rWirL^'^Hn 44 * 0 ^* 1 ' ATT G T G A A A A C A A T T.G T A cjblUj.ai^seq 
SI x A CD G C T .0 C A A G C A T A A T G. C T T A A A G C A A T A A T T G.T C. A A A A. G. A A T T 6 T A .M6b_.12.seq ' 

C ,C A A T A ' C C A C C T .C TTTGTGCC A TT G T t A C C T TTTT A T ITTC T. ACT C C T T C -alori tv 

•4900 



4860 

JL. 



4870 



4880 



4890 

JL 



CCAATACCA CC T GTTTGTGGGATTGTTACC TTTTT AT TTTCTAC 
CC A ATACCACCTG T.TTG T GGG ATt GTTACCT T T T T ATTTTCT A C 



CCAATACCACCTGtf TCTCGBatI 

C C A A taccacctgtttgtOCgatI 

CC.AATACCACCTCTTTGTG gR A t| 



G TEA C C T X 



G T T A 
G T 



T UTT 
TTTTT 
TT T T T 



ISMQ a c tJQ 

t tttBsactH 

TTTTCT AHflc 



A 


C G t 


l\ '1 


± 


CGT 


t; t 



2603__ai2.seq 
18rs21_al2,seq 
G T T G 515_ai2,seq 
G.T T G cjblli_al2.seq 
G T T G f»36b__al2.seq 



CGCAT CTTT T TT T TT GCTGCTACCAGC G 

T , . 



4910 



4920 




ATCTTTTT 
ATCTT T TT 
A T C T T TEH T 
ATCTTTTTT 
TCTTTTTT 




TAGTCAAT 
P— . — 

4930 
1 



GTT 



ACCT 
i 



G - - 



- A A C Majority 



C GT ACTCA A.TGTTACC 
CGTAGTCAATGTTA 

t aIJt c a a tJSJt" 

T Alii cH A f G T 

T A B T CAATGTTACC 




2603__al2.seq 
18rs21_ai2,seq 
S15_al2.seq 
cjblll_al2.seq 
h36b_ai2.seq 
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Weighted residue weight table. 
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C T T T T 



A A A C T T A A C A T C ACCTGCCAAT Majority 



4960 



4944 
4944 
4948 
4947 
4944 




TC T A T G A C C 
TGT ATC A G C 
i'CTGTATCA 

cjJJt a t g a 
tHBH tII t g aES c t 



4970 
a. 



TTGATTAAC 
TTGATTAAC 
A T T A A C 

aI8 tII a c 





4980 
— 1 




5000 



AAACTTA A|A Til ACCTCCCAAl 
A A A C T T A ABA Til A C C T G CC A A|_ 
A A ACBHa ACATCACCT gHD A A T 
A A AWT TBI ACATCACCT clfc A A T 
Ha aBtBTa A CAT.CACCT GCUA A T 



2603_al2.seq 

18rs21__ai2.seq 

5l5_ai2_seq 

cjblli__al2.seq 

h36b_ai2.seq 



J T A G C A T A A C C T G C T G 



4994 
4994 
4998 
4997 
4994 



T 

5010 
j 



G A G C T T G TGTTTCTTC 
1 



5020 

L 



TTAGC A«jA||C CTGCTGGAGC 
T T A G C AE a|| CCTGCTGGAGC 
TTAGC A.TAAC cfl gHt G gWM G C T 

19 tH gcataacctgctg g|| g c t 

T T A G CE T A A C CB GCiTGGAGCT 




1 

5030 
1 



A ACGTTCTAAGTACCTTC Majority 



5040 
,- .1 , .. 



gtttcttcaaggttgtaagtacc 
gtttcttca a ggtt g t a agt a c c 

G T T T C T T ctgAK MWJgf G 1 a|Q|a C C 

g t t t c t t cH abFgEBHt a a g tB c c 



GTTTCTTCAA 




TGC A AGACCTCTAACTTCA 



5044 
5044 
5048 
5047 
5044 



5060 
1 



aa ttga.ccttcatcg 

* 1 ' 



1 

5070 



A A 



TTTGAAGTCAA 



r 

5050 

L_ 

IT C 2603_ai2.seq 
fflTC I8rs21_ai2.seq 
T T C 515_ai2.seq 

cjblll_al2.seq 
h36b_al2.seq 



5080 
1 



T 

5090 
» 




cH AH A C C 

TGCAAGACC 




T A AQ T T 

TAACTTCAAATTG 



A C C 





TGCAAGACC TGTAACTTCAAATTGACCTTG ATCGTTTG A AG T ft« Al 
TGC A AG AC C T G T A ACT TCAAATTGACCTTGATCGTTTGAAG tBW a| 

I C C H G T A A W TTCAAATTGAC C TTBAT C GTtWg A A|J T C A A T T T A A 
A Ail TTCAAATTGAC CWW C G T tH G A»l| TCAATTTAA 

[tttaa 



GTTTG AAGTCA 



Majority 



2603_jal2 .seq 
18rs2i_ai2.seq 
515__aI2.seq 
cjbltl_ai2-seq 
h36b_at2.seq 



5094 
5094 
5098 
5097 
5094 



CAAGTTTACCAC C - -TTTTTA.TCTGTTACCCACTC ATAAGCTGTACGAGCT Majority 

5110 ^ 5120 5130 5140. 5150 

- 1 » L _ * t 



1 Ci V, 
V 11 V, 




TAG.C 
T A G C 
T THH A G C 
T t AIIC A G C 




TTATCTGTT All CCACTC ATAAGCTGTACGAGC 
T T A T C T G T T AflC C A C TC A T A A G CTGTACG A G C 
TTTTTAT.C T "1^S A C C C a|1t c|1t a|I G — — — - 
TTTTTATC Tt|H A C C C aH TCAT AO G 



A t. I A I. I I 

a t; 1 a 1; r t 



2603_ai2.seq " 
18rs21_al2.seq 
A G C T S15_ai2.seq 
A G C T cJblTl_al2.seq 



TTAGC ABcDtBtTT AT|1g T T A C C C A.CTCQt A A G C TGT AC G A G C T b36b_ai2.seq 



5138 
5138 
5148 
S147 
5144 



5160 



5170 
JL 



TCAATGAAGGCTGC A TfJ G T A Affi C T 
T C A A tC AAGG CTG C A T|S 0 T A A|] C T 
T|| A A |||] AAGG CBB CAT.TGTAAGCTT 
a kfWm a aH g cBD CATTG.TAAGCTT 



T C A A T G A AGCC TGCATTGTAAGCTTCTTCTTTAGTTTTCATAGTTG C T T T Majority 

• 1 * • ~~ ; — 1 : r~ J 

5180 5190 * 5200 

CTTGTTTAGTTTTG AT aCTW tW C TTT 2603.al2.seq - 
CTTGTT T AG T T T T.GAT aIIBtII C TTT 18rs2i_ai2.seq 
T T G T T T1§1:^ HtH aHB^tHc cRPfl 5l5_ai2.seq 
TTGTT rWmmlm TH&l AHA CtH G c|S!I1SI cjblll_ai2.seq 



ncH 



ATQAAGGCTGCATTGTAAGC 



I I* 

1 



T C T T G 



a c; c 



AGTTTTG 



ITGCTT T Ji36b_al2.seq 



TGCTCTTTTACCTTTTTGACCTTTTTGTTCTTCTCC A G A C A 



ACTTGTT AT Majority 



5210 



5188 
5188 
5198. 
5197 
5194. 




5220 



5230 
I 



G T|| CCTTTT Til ACCTTTTT gB TCT 

G TWD TH CCTTTT tH ACCTTTTT gH T C T_ 

gH t t t a c cH tH t t g a c c t tBt t g.t t||t T 

GTTTTACCT T H T T Gj g CCT tB T.TCT T|It T 
G T T t|| A C C J T^HDHfA CCTT TTT Gfl T llfl T 




5240 
„...i .... 



* 

5250 
i_ 



TGCAGACAACTTGf TAT 
T GC AG A C A A C t j G T T A T 

c Tin A gQc a aQ tHH t tH T 

C t|||a Glfc A AH tQ[ G T T A T 
C T G CMIG aHa BB t TGT TfRj T 



2603_al2.seq 
18rs21_ai2.seq 
515_al2.seq • 
cj b 1 1 l_ai2 .seq 
h36b_ai2.seq 



5238 
5238* 
5248 
.5247 
5244 



A A GCTG CGATACCTTC ATC TAAAGCTATTTTAGT AQT AG C T A A A G C T C T T Majority 

5 2 , 6 0 . . 5270 . 5280' 5290 5300 



A A G ClJ G C G ATAGCTTCATCTAAAGCTATTT T il T A. G| 
A A G c Pg CP A T A ftC TT OA T C T A A A G C T AT T T TUUT A G 
A A G C TMJHJ AM A G cBD CAT cMa a||g C t|WJt T T A G cl 
A A G C TIJBMH A H A G C/TT CAT cfl.AjJQ G C T AB T T T A gBa 
A A G CTG C G A ^BBBKIS cBl ; T CH aIST G c CTH Y t T T A Gfl ; A>| 



rn Gil 



A G C T A A A G^BTt 2603_ai2 .seq 

A G C T A,A A Ci^^^ t 18rs2Cai2.seq 
iAMMC T AQ A G CB-G flj T. 5 1 5^al2 . seq 

[T AH A GCT gDBI cJbUlJ.ai2.seq 
pA A G 'C f G t f h36b^l2.seq 



T T T T ,G. A G C T G CTTTTGC TTC 
~ r~ « 1- 



5310 



ATC TGC T. T T A A G TGC A AC G T.A UTACC T G C Majority 



5285 
.5285 
S298 
5297 
5294 



TTTTGA 
TTTTC'A 
TTTT 
TTT 
IB T T T GrA 




5320 
• ■ 



1 

5330 
• * 



5340 

JL 



5350 ' 



T GKjT Tffl A T C T G.C T T T A A G0G C A A G G T A TT T A C C T G C 26031al2.seq 

HT GBTtHa T C T GCTTTA A GQG C A AG G T ATT T A C C _J_G C 18rs21_.al2.seq 

TT G CBBC All C T gBtT T aQ G T G cQ aIJg T ABt tW C CEU C S15_ai2.seq 

T T G C.Tffl C AW C T GC TT T aBTg-TG CHaHgTATTtBc C3HC cjblll_al2.seq 

T T GUmC ABET GET T tBaBt.G CA AG GTATTTAC cflcc h36b_ai2.seq 



5332 
5332 
5348 
5347 
5344 



TGAGTTTTTAA CAACGAATTCTGCACCTCCCAAGCGTTCACTTT.CAGGTT Majority 

• 5360 • 5370 • 5380 5390 5400 

J : _- « < t m 

C G|H T C A 

c gHt C 



TGAGTTTT TR AC A AC.GAATTGTGCAC C|l GCCAA 
T ^AG TTTT TB A C A A C G A A TTGT G C A C cfl GCCAA 

twSBMr tBt t a a c a aHg a a "MB ,g cSIc ctgccaagcgttc 

t tdtt a a cHaHg a aBBHHg cBc ctgccaagcgttcagt 




T G A 



tttaacaacga aB t gtgcacc tDH caagcGttcB[gtttcag 




G T T 2603_al2.seq 
T T 18rs21_al2.seq 
T T 515_al2.seq 
T cjblll_al2.seq 
h36b_ai2.seq 
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Weighted residue weight table- 
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5379 
5379 
5395 
5394 
5394 



5429 
5429 
5445 
5444 
5444 



5479 
5479 
5495 
5494 
5494 



5526 
5526 
5545 
5544 
5541 



5573 
5573 
S595 
5594 
5591 



5623 
5623 
"5645" 
S644 
5641 



5667 
5667 
$689 
5691 
"5688 



C ATTT 



qp f"TYM- P ?™ TON /C fr I 0 ;IA GTC A C A A CTTTTC GTTC AGTT Ha,^ 



C A T T 
C A T T 



I 



5410 



5420 



GTTTTGACAAATTTCTT 
GTTTTGACAAATTTCTT 



5430 
i 



|;A C A A ATT Tl 



5717 
,5717 

5718 
•5717 
.5738 



5767 
5767 
5768 
5767 
5788 



ATTTGTTT Tl 

ATTTGTTT T|| ACAAATTTl 

AH ttgttttga c|H a a t t t 



ACCATGAGT 
ACCATGAGT 

CCA t|1 a g t c 

CCA tHag T C 
T T A C CW T GD G T C 



A C t. T 
A t" G \ 




T 

5440 
i 



T" 

5450 
»_ 



C A A CTTTTG GTTC AGTT 2603_ai2.seq 
J C AACTTTTGGTTCAGTT I8rs21_ai2.seq 
A CB.A CTTTTGGTT cBIgWt 5I5_al2.seq 
A CH ACTTTTGGTT cHgHt cjbl I l_ai2.seq 
AC AACTTTTGGTTCAGTT h36b_aI2.seq 



GGGTTGATT C.G TG'TTGGGTTATCT 



GAATCTTT 

i ' ' 



GCGTT] 
GGGTTi 
G G G T T 



5460 
i 



5470 
— t 



GGGTTGATTG gH G T T G G 



G GHT T 



ATTACCAT 




5480 
i 



GGTATTGG T GATGCT T A C 

— ■ j- 



— r 

5490 



Majority 



5500 

L_ 



ART GGTGTTGGGTTAT CHG A A T CT T T G G T A T T G G TljA T G G TT A C 2603_aI2 seq 
! AHT G G T G T T G G G TT A TCHg A AT CTTTGGTATTGG tBa T G G T T A C 18rs21_ai2 seq 
G A T T G GBgTT G gBtHa THY G aHtCT T T G1ITB5T TOTT GATGGTTAC 515_at2.seq 



T.G 



T T T C 
, 



Br 
v 



G T'T G G G T T 



A TUT G Ajfl TP T T T GQ T|B T T fl T G A Tl 

T C T G A A T C T^jaflftlG T A T T G G T GAT 



1 



G^TJHAC cjbill_al2.seq 
GBnlBc h3Gb„al2.seq 



A A G 



A G T T A 




T - - 



- TGCACTAC C-G TAACCAGTAACACGTT Majority 



T T A C C A T 

TTA C CAT 

AilHH C C A T 

A C C A T 

T A C C A 




5540 
i 



i; c a r 



tBccgtaaccag 

Tile C G T A A C C A G} 
GlHAjHc G T AQC C" 

ahBg t aBc ci 

T A C cHt A A C C A G| 



T 

5550 
— i— 




|A AC ACGTT 2603_ai2.seq 

HAACACGTT 18rsf21_ai2-seq 

TAACACGTT 5l5_al2.seq 

T A A C ACGTT cjblll_.al2.seq 

OB AD A C G T T h36b_al2.seq 



CTGAGATC 



ATGT 



ATCTTTT A TTA 
i "■- 



T C A T CC A G A C C A G 
— — 1 — : 



TGAATTTACCT 
1 : — 



CTGAGATC 
C TGAG A TC 

C 111I1113 A Ilflllil 

c bbm aehb 

CTGAGATC 



AT 




GCG Majority 



5590 
i 



.5600 



A.GACCAGTGAATTTACC 
A GACCAGTG A A TT T AC C 

t c c a HBBBWBEB a aIQt 

t c c aU a c cDg t g a a HE! T 

CAGAC C A|| T G A A T TT A C C 




2603_al2.seq 

18rs21_jal2.seq 

515_al2.seq 

CjbUl_al2.seq 

h36b„al2.seq 



A A G T T A CCA GATACTGTAAATTT C ATACCATTACC AATG'TC 

: — t 



5610 



A A G'TT.A 
AAGTTA 
A A G 
A A 
A A 



i'uE5 r f 

(GIG A '! 

r i; v r 



C C A G 
C C A G 
G 
G 

C A G 




ATACTl 

A T.A C t| _ 

CTGT 
T G Tj 
ACTfcT 



T 

5620 
l 



i 

5630 
l 



GATTGTACC Majority 




AAATTTGATACCATT 
AAA T T T CATACC A T T 
T TTjE^^^^^PI A X 

t tIm^HBDHHa 

A A A T t TBA T A CCA T T 



i t; f 
! c; r 




5650 
i_ 



GATTGTACC 
G AT T GTACC 

t|| tBBI c 

_T G T A C C 

gat t lBMffl c c 



V V. A 
i C A 



2603_at2 T seq 

18rs2l_al2.seq 

515_al2.seq' 

cjblll_al2.seq 

h36b_ai2„seq 



ATT A G G 



T - - - G T T T T T G 
i r 



T C A A T G A T 




5670 
t 



ACTGAAGCAA 

T 

5680 



A 

"T* 



G C 



T CT ATCAT 
: r~ 



Majority 



- GTTTTTG 
- GTTTTTG 

-DtEHBt 

-D T T ff|T 



5690 
I 



it Tl 



GTTTTT 




5700 

T C A A T G A T A C T G A A G C A A C A GllT|lT ATCIIt 2603_ai2.seq 
"° A A _ T _ G AT A C T.G, -A A G C A A C A Gil T H T A T C_J T I8rs21_ai2,seq 
C A A D G DD A B|B|| A G C ABC aUc T G C A T 515_ai2.seq 
C A A_|G|ffl:AHll.A.G cMmMIaH C T clfiQll! C A T cjbUl_.ai2.seq 



C A A T G AjH A CT G Aln G C AACAGCT 




AH h36b_a!2.seq 



TATCTTTAAA T.G T G 
, 

5710 



T A A A C A A C G T T T A C A T T A T C T G G T T C. A C t A C C T T C T Majority 



JL 



5720 
_l_ 



"T 

5730 



T A T C T T T|| AATGTGTA A A C A A .G G T T T A C A, 
t A t C T T T P A A T <____! -T. A A A C A , A .C G T t. T .A C A 
BD » C T TTA 

t aH cH tQa 

A T 'C T f T 




l£ A t T 




"2603_al2.seq 
18rs2l_ai2.seq 
5151al2-seq . 
cjbllXlaI2.seq 
h3(Bb^aI2.seq 



G C C C AA G T TT T AG T A A CT6 TT A T T T C AC 

: , , , — 



5760 
L 



CCTTTGATGti T C T A A C T G,G T A G Majority 



5770 
— I 



GBC CAAGTT TTA GIB A ACTGTTAl 
CHC C A A G T T T T A gH ,A ACT G T T a| 
G C C C A M C T T T tH|T A A c||G tQ A 

cccc aEEmmm t t t aH t|B a C^D trI a 
cccaagt r t tBot a a cHgtH a 



5780 
L; 



IT T.C a c c c t t t g. a 

H T T X A.C.C C T t T G a| 

t t t coin t tjJgW 

TTT CB'C CH T TT G A| 



5790 
1 



GGTGTAAC 
C C T G T A A C 

g qHBIt aQc 

g gHc t aH c 

GG TG tHXc 




2603_ai2.seq 
i8rs21_al2,seq 
5t5_al2.seq 
cjblll_al2lseq 
h36b„aI2.seq 



TT.CTTTCATTTCTTTACCTGGTTTGTTACCATACTCCAATT.T CAT ATCAT Majority 

5820 ' 5830 5840 5850 



5810 



T T Cl|yj T C A^| 
T T cB Bt C ABU 
T BW T t 

thht t_ 

T T C T T TH A 



t. i □ i c a i 



V, CWMM v A (' 



TCTTTACCTG GTT TGTT ACC At A I^S| CAATTTGAT ATCAT 2603_al2.seq 

T C TT T ACC T G G T TTGTTACCAT aQQ9 C A ATTTGATATCAT 18rs21_aI2 seq 

TTT CtMJMgflU TGgBtTGT t|H C cB T A G T C C A A T T tKJ All ATCAT 515_al2.seq 

T T T C T TBDHUT G GBt TflT T ACC AT AGTCC A A TT tQaQaT CAT cJbUl_.ai2.seq 

T TiT TTACCTGGTTTGTTACcBt aBItHHaHt TTGATAT-CAT Ii36b_aI2.seq 
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Weighted residue weight tabJe. 
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5817 
5817 
5818 
5817 

5838 



5867 
5867 
5868 
5867 
5888 



5917 
5917 
5918 
5917 
5938 



5967 
"5967 
5968 
5967 
5988 



6017 
6017 
6012 
6011 

6032 



6067 
6067 
6059 
6058 
6082 



6117 
6117 
-6109 
6108 

6132 



T G G T T 



TTAACAGTAG C T C T A T 



5860 
_i_ 



t g g*i t t c t g g a 
t g g0 t t c t g g a 
tggtttc. th g a 
tggtttctgg 
t gRt t t c t g g 



T TH T 

tth't 




C A A| 
C A a| 
T T C A A C| 
T C A A C A| 
T c a|!c A A T A g c| 




5880 
i 



A A 
r 



Majority 



5890 



5900 
l_ 

C T A T A A 2603_al2.seq 



G A C C|]T TAACAGTAG cj 

G A C cHt TAACAGTAG CM C Y A T A A 18rs21_ai2.seq 
A C C G T tWaIIaG TAG CTWBt A A 515_ai2.seq 

C C G T THA CWG f A G C tEHB T A A ejbll I_ai2.seq 
ACCGTTAACHgtBg CBCTAT All l»36b_ai2_seq 



GTCAATCTG 



A A T 
"i 



TCAACATC T TTT G T T T T 

"' ' 1 T 



5910 
J 



T G C T G C T T T T T C C A C 

«T ~ 1 



T T T T 



5920 
i 



5930 



GTCAAT.G T H A A T T C A A 
G T C A A T G tQa ATT-C A A| 

G T D A U D T 0 A D T T C A A 
G T|fl AUU T G A H T T C A A 

GTCAATGTCA ATTCAA 



A t; t: 
a c; c* 



HA T C 
D A T C 

C A T C T T T 
CATC T T Tl 

c aWc t t| 



5940 



G C 
r- 



5950 



Majority 



T G T T T Til GCTGCTTTTTCCA 
T G T T T tD GCTGCTTTTTCCA 
T G T TTTTG cH G C TTT TBI CCA 
IflUJ T T T T G cHc C TCP t H cB A 
T G T T tDH g .C t||c TTT TTffllA 



CAAACCTTTAGCTGTGAATTTTAATGTG 

i 1 1 




• 

5960 

_-j 

gT|J AGCTGTGAA 
H TULA G CTG..TGAA 
Ttt 

tHt 



5970 
— a 



A A A C C 
i 

5980 
i 



ACGGTCAT c a 



5990 
1 



A C C TQ t 




TTTTA ATGTGAAACCACG GQ C A T C A 
_ TTTT A ATGT GAAACC AC G GHC A T C A 

cHc TP a A |§gf JIB A ^^^^[ G A A A c c IMI g g t c a t ell 
c|9u to a aHBOH aBHS9 g a a a c c BB g g t c a t cu 

C T G T G A A TTTBa aQg T G AAACCACGGTCATCA 




2603_ai2.seq 
18rs21_al2.seq 
Sl5_al2.seq 
Gj b 11 l_a i 2 '. seq 
h36b_al2.seq 

Majority 



2603_al2.seq 
18rs2i_ai2.seq 
515„ai2.seq 
cjblll_al2.seq 
h36b_al2.seq 



GTTTGTAGTC TGTAT 
1 , — 

6010 



CCTTAACA A AAGTT C C T 



G 



6020 
« _ 



CT TDDt aotc tgtatcct t ah 

G T tBB|t AGTCTGTATCCTTaB 
GTTTG.T AH Til T GT aQ c BjJ A A 
GTTTG T A G t|| tEH aH cBBffl A A 



TTGT All TGTGTA 





■ CCCATGCTTCA Majority 



604O 



I A ti I 1 t 
I A bill' 



AC A 




6Q50 



G C T TljA 2603_ai2.seq 
C WL£ AO G C T Til A 18rs2I_al2.seq 
C CB A T Gil Tile A 515_ai2_seq 
C C||A T Gll Tnc|| cjblrl_al2.seq 
CGATGCTT C|| h36b_ai2.seq 



AGGTTAACAGTGTTACCCATTGTCAAACCATTTGACATGCTATCTGT.C C A Majority 

^ f 6080 ■ . - 6090 ' 6100 

JL 



A G G 
A G G 



6060. 

JL 



6070 
_i_ 



T A A C A. G rWBB AGCCATTGTC A AACC ATT.TGACA tHQ TATCTGTCCA 2603_ai2.seq 
^ A A Ji A ^HBBACGJC A TT G T C A A A C CAT TTGACA TlB T A f CT Q'T CCA 18rs21_ai2.seo 
> , A G T T f8IH3 A ED G T G A A A CCnTTW.G||c ATGCTAT cSg T C C A 515 ai2.seq 

> 1 1 A A G T T P™n ADD G T C A A A C c||t.tHg||c A T G C T A T c Bg.T CCA C JbUl_ai2^seq 

GTTA AC A GT CTT A CC C A TTG T C A A A C C ATTTG AC AT cflT A T cDSD C C A h36b ai2 seq 



AACCAAGTTTTTGTATT TAGAACCTTTGTGAATTTTTGTTTTA AC TTCGT Majority 

6120 6130 6140 6150 



6110 



AACCAAGTT T Tig G TATT-T-ACAACCTTTCTGAATTTTTCTTTT AACTTCfcjT 2603_ai2 seq 

A A CCA AG T TTTflkT AT T TA G AACCTTTG.TGAATTTTTGTTTTAACT T 18rs21_al2 seq 

P. TgT Tjl T A|J 1 HQ G AACCTT TlMj G|jATTTTTGT t BW A A ctHmS T 515_al2 seq 

l TH T TH T AH TDD GABCCTT THIgA ATTTTTGT TDD A A C tOH G T eJbUl_al2.seq 

r T TTT G T A TT TA G AB C CTT tUtcBatTT T T GT T T T A A CTK G T h36b ai2.seq 




AAGGTATAACTT T 
, . — 



ACCG ATTTCAG 




T 

6170 
— I 



A A G C 



TACTTAG CAT A A T 



6167 
6167 
6159. 
6158 
^182 



6210 

, a 



AC CG.AT.TTC AG 
A C C G A ^rrjD X G 

AC C G ATTTCAG 



C T G C CCCAG C T 
■ 1 



C A G T A G C 
i 

6180 

M. ' ■ I 



AGTTGCTTTGTCTTTTTGA Majority 



6190 
■ 



T 

6200 



CAGTAG.C 
d A ,G t ifc A .G. C 

cllfpnQQc 
g||;gBDc 



A G T. T G 
AG TT G 

•a &m 




C T,T T G T C 
C T T.*T;G T.Gi 

c t "t t G-T. e;^ 

CTT T C : T 
CTT Til T C T 



\ I" WWM ^ t 



TED A 2603_ai2 .seq 
i B KI A T8rs21_ai2lseq 
t .T T^BBj 5 15_al2,seq 
M &H?£I!I G)blll_ai2-seq 



C, V C 



G T C A A A A 

-— . , 



T ABt T ARC ATAAT 

t aHttaIIc-a T A A Tj 

G TTjBTc C ATAAT 
G tBU CP A T A AT 
T All T TfSHCia.T AB T 



T - 



6220 



C T C T A T T A A C A *C T. G T C A A Majority 

.t i 



T G CGCCAGCT 
fjl G'C C CC AGCT 
CI 

C T G CI 



6230 
JL 



71 

6240 



G.T C AAA A 
G T- C A: A A "A 




G T C T A 
GT.CTA 



6250 



B 



T T A A C A 
TT A A C A 



T G TC A A 
TCTCAA 



ii-i'll t; a 




2603_ai2.seq 
1 8rs2 l_la!2 .seq 
-515_ai2.seq 
cjblll^ai2.seq 
h36b_al2.seq 



~ TGTCAAGTT GT TTG TTTTA 
— , , 



6260 



6217 
6217 
.6173 
6172 
6231 




TGTC'A AH T| 
TGTCA aHt| 
A A G T T| 
A A gB.T 
c A aDt t 



C A A 



6270 
— I 



G C A A A G T T TT T A T C TATTTCT GGTTTT 
: i 1 ■ 



T 



Majority 



6280 
i 




G TT TGTTTTAGCAAAGTTTTTA'TC 
GTTTGTT T TA GCAAAGTTT. TTATC 
T T TjJjTnT^Hc CH A A G T.tW TTATC 

t t tBtBtHg caaagttHttatc 

ttt gQt TTAGCAAAGT TTT tR T C 




6300 
t— 



TGGTTTTT 
T G.G f TTTT 
TGGTTT T^ 
T G G T TT tB 
T G G tR8 T T 



2603_al2.seq 
18rs21_ai2.seq 
515_a£2.seq 
cjblll_al2.seq 

h36b_al2.seq 
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' OT T C A C,,*,^ gfQy* A,A ; g,Q. f gjHftj TCTTCAACAACACCATCTTCC Major, ty 
63*0 632Q ' 6330 6340 6350 

6267 CTTCAGTGTTCTTTCGATAAA cB|T CBCC A T cBB C A A C A A CA C C A T C T T eft 2603 ai2 seq 
6267 C TT C A G T G TT C T T TGGATAAA C||t cffl G C A T c|Bca ACAACACCATCTT cfl 18rs2l ai2 seq 

6214 BtTCAGTGTTCTTTGGATAAACGTGAGCHt cTt||a ACAACACCA tHJJt C G SlS^aii seq 

6213 BIT C A G tCT ; T T C T T T G G AT A A aIIcT G A G CAT C T TP A A C A A C A C cDtBHtDg clbUl_ai2 seq 
6279 C TT CDG TflTT C T T T G C A T aB A c|l tB A G cQt cflBfcB A C A AflQc c|| T C T TC G h3Gb_ai2.seq 

TTTACCAATCG AAGAGTGATTTTAACTGCAACTCCTTTTCAATCAGCCAG Majority 
■ 6360 6370 s 6380 6390 6400 

6317 TTTACCAATGGAAGAGTGA TBT TAACTGGAA CfgG CTTTTGA aQc AGCCAC 2603_ai2.seq 
6317 TTTACCAATGGAAGAGTGA tE T TAACTGGAA CQG CTTTTGA aR C A G C C A G 18rs21_ai2.seq 

ATTTTAACTGGA ACTGCTTTT G.A A T C A G C C All 515_ai2.seq 
6263 TTTJCC A A TG cfl A G A G TG A TTTUA C T G G A A CTGCTTTTGAATCAGCC a|| cjblll_ai2 seq 
6329 T TEDSC UBCG AAGAGTGATTT tQ A C T G cffl A cH G C T T T T G AQ T C A G cQa G h36b_*12.seq 

GATGCAACCATTATTGTTGTAAGTAGATTTTTCT T'T CAATTCAACGATTT Majority 
"' • • 6410 6420 6430 6440 6450 

63 ^ G AljG GAACCATTATTGTTGTAAGTAGATTT tIWJt T tBaHtT CAACKjATTT 2603_al2.seq 

6367 G AfflGG A A C C A TUT T G T T G T A A G T A G A T T T tBH T T tB aQ T T C A A CQ A T T T 13rs2I_at2 seq 

6314 G A TjG A A C C A tHJt T G TT G T AW G TAG ATTTTTCTTTCAATTCAACGAT T T 515_al2.seq 

6313 G A T BG A AC cBttB T tB t||g T aDIItII G A T T T T T C TTT CAATTCAAC G A T T T cjblll_al2.seq 

6379 BAflBBBBnC AQB A T t|| TT|!t aIBtAG A TTtBtcBBc AAllTCA ACG ATTT h36blai2.seq 

GGTAC TTTCCTTTTAATTTTGTAGTG-TT.A AAA ACAA - - - : --TACCTTTG Majority 

- 64,60 . 6470 . 6480 . 6490 6500 

6417 IBM A C tBHcCTTtHaA tBHtBtBg T G T tIBA A A A C A A flOrtBE T Ale T flBfr 2603 ^12_<^n 

6417 BBACTBHcC T T TH A A TBBjJDTB G T G T TBA A A A C A aHBBHHBt aIIctBW 18rs21_ai2-seq 

6364 G G T ABB T TCCTfflTTA ATTTnGBAG TGTTAA aDBDQ A Q A C cKfTTG 5l5_al2.seq 

6363 G G T AEKj.T T C C TTT T A aHt tB GD A G TGTT'A A A-AAC A A HaC cBtTG cjbll l_ai2.seq 

6429 G G T AQ3 T TCCTTTT A aB TTT cB AGT|JtTA A aBBD A A - tBc cB T Tfl h36b_al2.seq 

TCTGTTGTCA ATCC ^ - - - , TTT A AG Majority 

6510 ; 6520 6530 6540 * 6550 

6467 T C T GIB T G T C A A T C cMBBBljM«BBB«M MM««BBi«Ml V tB A IB C 2603 ai2 seq 

6467 tctgBtgtcaatc cHBIMMIHBHIBMMIIBIIBBb t ifl ^ Bb g i8rs21.ai2.seq 

6408 T C T6T T G t||a aB.C C - , - _ - . - - - 1 ------ f T T-A A G 515 al2.seq 

6407 TCTBtTGtBa ABCC i T HB A A G cj bl i l_a iTseq 

6473 T CIJGT TGTCAABcC TTtAAG h36b.ai2.seq 

aacaccatctt.cattaatttctgttggttttaatttgt ta -cttccttac Majority 

^ 65.60 6570 6580 6590 6600 

6517 aBJa CCATCT TflAT T A A T TB^WgBt G G Tffl T TAATTTCtT A - cB tB C T iflftf 2603_ai2 seq 
6517 AffllACC A T C T t E A T T A A T f BKmB gBt GGTBtTA A TT T G T T A - cfl Tl|jC T jJMl I8rs21_ai2.seq 
6428 AACAlJC aQEE 1 <C D T 2^11113 * C T JL T JL gHB^tIBI A a|Q tB tWaR C T TCB TflAC 515_ai2 .seq 
6427 A A C AH G AB^BBj oBlTBpaBM.T. C tii-T H,GBi,tM,A aM tHrBBB C TTCCTTAC gJ bl 1 l_a 12. seq 

6493 BACA C-cBB.-C.Tfl C A tBDI A THBBBBHBBBBHH A EFr tIDTT^ CTTCBBtAC h36b_aI2.seq 

CACTTT CCCTAATGTATTTTG TTTTATCTTCATTCATAAC--a.AG C Majority 

r-i : i : n — : ; : i : r- J J 

• .6610 6620 6630 . 6640 . 6650 

6566 Jl|c T fBHiHt aJIt Gn ^AflT T.i'BBB&lT T T T. A T c T T C |i x T. .C aB aR c|M]a G C 2603.al2.seq 

6566 -BBlGT TM t jM T Cfl Afl T tJBHI,T^A XC T T c|| T T C AB XH C tBB A C C i&k&l^Z^ 

6478 ^A|It T TpBG TBT T GVTAT T f B^^^ tIQI Afl C TT C A T T,|BpTlpff^^ A G C StlSlaiS.seq 
6477 CABHT GBg tjj. A TGBa TTTtG-^ - tMBH ABtl TTCAT ^BD^iMfl - - - Afl C cj bi 1 l^tlTseq 

6528 CACTT T cfl G Tffl A.T|fT A T T T TC - - - - ^BPB^BDrt T C A f TC^A T 4 A A C - - -ff^C h36b al2.seq 

AAAGAAAG C A C C'T TCGATTTC t T T A G A f r - C C T T CG'CC AAA GTA.AG GT'G Majority 

,6660 6670 6680 . -6 690 *' 6700 . 

^6 A A A G A A|jG C. ACCTTCGATTTCTTTAGAT - -V C cflT C G C C A A A G T A A C Cfl G 2603_ai2.seq 

6616 A A A GAAflC C A CG'TW GATTTCTtTAGA T CCflT C G C.C A A ACTA A C cfl G 18rs21_al2.seq 

6521 AAA GHA AG CAG C JOHTc G AT T TC T T tB G fl JHBB:C C tHH G C C A A AG T A aWMT tIM 515ial2.seq 

6520 AAAGAAAG C A C CM TH G A TTTCTTT AgBEDBIBII t|8Bg C C A A ABt A_aJHHtB cjblli ai2.^ed 

6571 UAGAA AG C.A C ClfiBic.G A T T T C T TT A G Afl- - - CC TTBHc C A A Afl T. AlfcC T G' h36b„al2,seq 

TAAGGTCAGTAAT T*T G T T T A C*C TATGIAGTCTTTTCCATTCAT A C C T T T T Majority 

6710 6720 6730 6740 6750 

6663 pAAGGTCAGllAA T |Ui| T UI ACCTHTGTAGTCTTTTC cBt tQaQACCTB Tlil 2603_ai2.seq 

6663 B A A G G T C^ACflAA tBHH TflP A C CJtHxG T A G T C T T T T C cH t tB AB A C CtHtH 18rs21„al2.seq 

6571 TAAGGT CHiltt A TTTGTTTAC Cg| AWG T ACT cKW T g—l T TflcllT A C cBfr T T 5I5_al2.seq 

6570 TAAGG T C A|] T A ATT TGTTTA C cB A.BH T aR TC@ Tflffc A f ^ T cBtA C C T T TT cjblli_al2.seq 

6618 BAaUgT C A G T A aHt t|I T tH A C cfl A Tfl T AQ TBflH T T C C A T T C aBBI C cBt T T h36b_al2.seq 
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G T T C C Tlte"<ji.C gSg g j jggSl fcEig A G -c f T"iTr 

6760 6770 6780 

GTTCCTGGGAAGTTACTTTTGTl 
G T T C C T G G G A A G T TACTTTTGTl 
G T«C cE Gp~~ 

g tHc c tT 



CT/US2005/02723? 16 



6713 
6713 
6621 
6620 
6668 



r a i; 
v v a 



TTCC TGGCA aB T T 



y A A A 


n 


P A A c; 


T T 


|i t: c 


T T| 





GTCAAA 
T C A A A 




{, t At 

(i r v i 



(. i; A 



|t gtttgHa c a a t 



ATCTT Majority 
6800 

: L 

A ATCTT 2603__al2.seq 
A ATCTT 18rs21_al2.seq 
CACAATCTT 515_al2.seq 
G A C A A T_£T T cjblll_ai2.seq 



6763 
6763 
6668 
6667 
6718 



GTGCAAGGTCACTGTA T T A G 
' i r 

6810 6820 
— 1 1 



T T G T 



G T G C A AB GTCACTGTAT 



— i — 

6830 
— I 



|T T h36b_ai2.seq 
Major I ty 



T - - 



G T G C A AB GTCACTGTATTAGT - - 



GTGCAACGTCACTG TM T 
GTGCAAGGTCA cH G T A T 



G T Gfl A A G G T|| A C T G T 




T 




TGCTTCATCC GCAAACG 
i _ p- 

6840 6850 

: i 1- 

TGCTTCATCCGCAAACG 2603_al2.seq 
TGCTT. CATCCGCAAA CG_ I8rs21_ai2.seq 
T G C T T Cp^ 
T G C| 



a i r i r v i a i v 




TCCGCAAA CH 515_jai2.seq 
TC 'CC C A A A C G cjbll jL_aiZ-seq 

h36b_a!2.seq 



G C A A A 



.6804 
6804 
6712 
6708 
6765 



CTGGTG CAACTGAGAACAATGACGTTAAAGTCAGTA 

.6860 6870 6880 
' 1 ■ * 1 



CTGGTGCAACTGAGA 
CTGGTGCAACT. GAG AL_ 

CTG GTGti A ACTG A GAACAATG AQ G T T A 
CTGGTGCAACTGAGA AQ aH_T G A C G T T A A 

cQg gBg c|Ia cBI g aI|a a c aaDg aDgt t A A 



ACAATGCCGACAAC Majority 
6890 6900 

j : jl 



CAATGACGTTAAAGTCAGTAACAATGCCGAGAAC 2603_al2 seq 
iAATGACGTTAAAGTCAGTAACAATGCCGAGAAC 18rs21_al2.seq 



fin 

AH 



T C AG T AHC AATGCCGAGA aQI 515_aI2.seq 



T C 



T AfflC AlflT G 



m m — wCGA'GAAC cjbtll_al2.seq 

TBaBt AACAATGCCGAGAAC h36b_ai2.seq 

ATTGCAAAATATTTGTTGA TTCTTTTCATTTCTATCTCCTTCTTATTTTA Matorltv 



6854 

6854. 

6762 

6758 

6815 



6910 
JL 



6920 
ji 



6930 

JL 



6940 



-r 

6950 
jl. 



ATTGCAAAATATTTGTTGATTGTTTTCATTTCTATCTCCTTCTTATTTTA 2603 al2 sea 

i-At'o^aa-a-ta't"™^ 



A 

A T T G 



A A 



ATTTGTTGAT 



TTTTCATTTCTATCTCCTTCTTATTTTA 515_ai2.seq 
— TTTTCATTTCTATCTCCTTCTTAT tHt A clblll al2 sea 
ATTGCAAAATATTTGTTG.ATTCTTTTC ATTTC TATCTCCTTCTTAT TT T A h36b_aT2.seq^ 



6904 
6904 
6812 
6808 
6865 



G T TAATCAACATGATTAATAATATGC G G ATTTTAATA 

* * 6960 6970 6980 

• * , — I — : — ; u 

GTTAATCAACATGATT A ATAAT A TGCGG ATTTTAATA 

GTTAATCAACATGATTAATAATATGCGGATTTTAATA 

GTTAATCAACA T G A TTAATAATATGCGGATTTTAATA 



GTTAATCAA C A T 
G T T A A T C A A C A T 




T A A T A A T A T 
TAATAATAT 



CGGATT 
CGGATT 



iTAAt A| 
T A A T A 



■C-ACCCCAGCAC Majority 

6990 7000 
— 1 1_ 

-C-ACCCCAGCAC 26Q3_jal2.seq 

-C-ACCGCAGCAC 18rs2i_jai2.seq 

- C r AC CGCAGCAC 515_al2.seq 

OH.-, A CCCCAGCAC cjbl 11.^12. seq 

ACCCCAGCAC h36b__aI2.seq 



- C 



6952 
6952 
6860 
6857 
6914 



CACTCCCTTCAAGTCATGGAATTTTAG TTAATTAATTAAGAATACTAAAG Majority 

7030 7040 7050 

J— t 



7010. 
jl, 



7020 

JL 



° ^^^n^^*S TCATGGAATTTTAGTTAATTAATTAA « AA TACTAAAG 2603 al2.seq 

B GAATTnACT.TAATTA.ATTAACAATACTAAAG 515_ali seq ^ 

C A C T C CBT T C^^^&^^.G A A T tH.T aWtt A A T TAATTAAGAATACTA.AAH cjblll al2.seq 

■B Ail T T A A T T A A|| A A T0 C T A A A G h36b_al2.seq 



CACTC C 



TTC| 
T T C A A"! 



A A I 




GCATAATTTTTAATCTTTTT TGATGGACATATCACTAGATTTCTTATAC Majority 



7002 
7002 
6910 
6899 
6952 



7060 



7070 



7080 
L 



• •7090 
JL 



7100 

JL 



ffcTT A A T T TIt.AAKt.TT T T T G A TG G A C AT A t C A C T A G A T T T C T T AT A C asoa.aia.s^q 
'SSSf-JMl I I I I t A * q T T T X y * 7. G C * ? A t A T C A C t-A C A T T T C T A T. A C M«^itlSeq 



e 
c 



C A t_ 
|C A T A 




TT A ATCTtTt 

It ta at ctt fB 



: A ,T ATCACTAGATTT'Gf T.A.TA 
A T.C AC T A G A TTTCTTAX 




cJblU>jal2 ! iseq 
K36b_jai2.seq 



7052 
7052 
6960 
6949 
7002 



C T T T T C C A A A TAT A A A T T C CA C C T G G A ATA G A C A T.C A T' A G CTCCACC TAT Majority 

• . 7110 . . 7120 ' 

CTTTTCCAAATAT AA A.TtCC ACCTGC A'A 



7130 



T 

7140 . 7150 

1— J L. 

TAGACATCATAGCTCCA CCT.AT 2603_al2.seq 



ZIZ Z J C.q A A A T A T A,A A T T C C A C C T G C A A T A G A C A f C A T A G C T C C A C C TAT ISrsJl^^eq 
CTTTTCCA A AT AT A A A.TTCC ACCTGC A ATAGAC ATCA T A G C T C C A C C T A T 515 aI2,seq 

????tS> A ^J AAA ^^ AC P TGCAATA « A C A TCATAgHtC P ACCT.AT cjbll^seq 
C-TTTTCCAAATATAAATTCCACCTG C*A ATAGACATCATAGCTCCACCTAT -to6b^.al2.seq^ 



TAAAATGAAAGATAGAATTCCTT TC CCACC 

7160 



7170 



7102 
7102 
7010 
6999 
70*52 



TGTCATAGGAATAATTCCTT Majority. 
7180 ' 7190 7200 

J 1— : l 



T A A A A T G.A'AA G AT AG AATTC CTTTC C C ACCTG TC A T A GG AATA ATTCCTT 2603_al2 seq 
TAAAATGAAAGATAGAATTCCTTTCCCACCTGTCATAGGAAT.AATTCCTT 18rs2^_ai2.seq 



TAAAATGAAAGATAGAATTCCTTTCCCACCTGT CH TAG 

taaaatgaaa.gatagaattcctttcccacctgtca Tp* 1 

TAAAATGAAAGATAGAATTCCTTTCCCACCTGTCAT 



GAATAATTCCTT 515_jai2.seq 
GGA AT, A ATTCCTT cjblll_a!2.seq 
GGAATAATTCCTT h36b_ai2.seq 
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7152 
7152 
7060 
7049 
7102 



T T 
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G G T p fi fc A T ^ m ggj T y g^Tft T G C~T TCTCACCTTCCTCATC A Majorl^ 
7210 Tx»"?n tooi-i ,-„,-,„,-. .. 



7220 

JL 



7230 



7240 
1 



7250 



JJr J i tKITmUTTUATCCTT C T CACCTTCCTCATGA ISrsl! tlTL, 

T-TGGTGGAATATGCGTGTTGGTAATTAAATGCTTGTCACCTTCCTCATGA h3Gb all t^T 



££2 jA^JJtSJftlSJSI*** " * * C A G C T A T T A TATTTT.TTATCGATCCTTT 2603 a!2 sea 

7202 TATTCAGAAATCTCTTTATTAACAGCTATTATATTTTTTATrr«TrrTTT iTiT 

7H0 TATTCAGAAATCTGTTTATTAACAGCTA.TTATATTTTTTATCGATCCTTT Ifff f iff"" 

7^ TATTr^?»«* T< ' T< * TTT * TA ^*^ B TATCGATCCTTT 

7IS2 TATTCAGAA ATCTGTTT.ATTAACACCTATTATATTTtItATCGATCCTTT to6b "l^ ^ 



?™ A A ccJr^r AAA ^^I AAAATTCCTTTAT T AC TAATTTTlTC ATAATCCT 2603 alZ.seq 

7^ AACCAC??c^A A r^ AAAA n^ GTTTATTACTAATTT I 8tS il_a l^eq 

*"»" A * mm GTTCAA *AGTTAAAAT TG GTT. TATTAGTAATTTTTTGATAATrr-r sic -i^ _ 

7149 AACpACTTCAAAAGTTAAAATTCGTTTATTAGTAATTTTT^CA^^Cc? ,n : ? 

7202 A A cQ A CTTCAAAACTTAAAAT T G GTTTATTAGT A'A T TIT IT G A T A A T C C. T^' ftSfiJjilzlseq*'' 



7302 
7302 
7210 
7199 
7252 



CCGGCGAAACTGCTTCTATTAACTGATA T T TGCCATCT T t~C AAATCTTTr m 

CCGCCGAAACTCCTTCTATTAACT-GATATTT C TCTTT A in ^flllTL 

CGGCGAAAGTGCTTCTATTAACTGATATTTGCCATCTTTCAAATCTTTG eihii I ^T? ™ 

CGGCGAAACXCCTTCTATTAACTGATATTTGCCA.TCTTicA AA?c"?c ^aTzls^T 



B 



7352 
7352 
7260 
7249 
7302 



7452 
7452 
7360 
7349 
7402 



"TA'AGAAATTTTrrrr^T^Tro^^ ^ GTCAC TACTTT TG A'ATTA TTATTTTT ISr^aiZL, 

IvvSM ATTTTGCeGTTT ^ CTCCC€TCACTA CTTTTGAATTATTATTTTT 515 ai2 sea 

TAAGAAATTTTGCC G..T T T-T CTCCCGTCACfACTTtTGAATTATTATTTTT elblTl iT^^ 

TAA GAAATTTTGCCCtTTTCTCCC G T C ACTACTTTTG A ATTATIATTTTT ^"l^se^ 

TATTGCTAAATAAAGTTTATAATCT TCATTAAATTCTTGAACTTCAAACG Majority 

7460 7470 7480 7490 7500 



74^2 HTIIffTinTT^ 2603_.ai2.seq 

mo TATTGrTAAATrAAA^v3^^^^ TCATTAAATTCTTGAACTTCAAACG lSr^a^.seq 

7310 TATTGGTAAATAAAGTTTATAATCTtCATTAAATTCTTGAAGTTCAAArr 

7352 T A T T C* T A A A X A A A G T T\T A T A A T € T T C A T T A A A T T C T T G A A G T T C A A Aflc 



515_ai2.seq 
ciblll^ai2.seq 
h36b_ai2.seq 

TAGCTCCTTTGAGAAGC A A CTT ATTATTATCTT T A T C A A C T T T T A T A A A T Majority 

75 . 10 . . 75 2° 7530 7540 TS 50 

T A G C T "C C I I I G A r A 1 H A a|SJ J A T .? A T t a t c T TXA TCAAt T T X T.ATAAAt 2603^12^ 

TXCC? CC?£? £a£» A W ^ A nj:TATTATTATCTTTA tC-AA.CTTTTATAA AT IS^alSTseq 

TAGcJ CC???iAr?Ar^ a a ^1 I A " ; A T T A T C T.T T A T C A A C T T T T A T AA A T SlS^-se^ 

Ja?S3^^J^ A G A -A G C A A C t T A T T A T T A T C T T T A TCAACTTT tHt A A AT clbiil alTsea 

T A G C ICG TTTG A.Q.A A G.£ A A C T T A T T A T T A T'C T TTA T C AAGTT T T |.T A A A X t^l^T 



7502 
7502 
7410 
7399 
7452 



7552 
7552 
7460 
7449 
7502 



1^^^^^ 2603.ai2.sea 

TCAATTTP AT rli*ZZlZlZZZ~ G TT " T A A T C G T T A T TG T A G G A T A f T C 545_>i2.seq 

TCA A T T T r*i r r t 4 i r-r II a G-TTTTTA ATC GT-TA TTGT A.G'GAT ATTC cJbTll^seq 

TCAATTTCACCTAACTTCTTCTCGTTTTTAATCGTTATTGTAG.GATATTC h36b ai2 seq^ 



TCTCACATCACGAATTTTA 



7610 
■ 



G G GATTGCAAAATCTCTA 

•- — 1 — 



1 

7620 
— 1 



7630 
» 



T C T C A C A T a A C G A A T T T T A G G G A T T G € A A A 
TCTGACATCACGAATTTTAGGGATTGGAAA 
TCTCACATCACGAATTTTAGGCATTGGAAA 
TCTCACATCACGAATTTTAG. GGATTGGAAA 
TCTCACATCACGAATTTTAGGCATTGGAAA 



A T C T C T A 
A T C T C T A 
A T C T C T A 
A T C T C T A 
A T C T C T A 



AGTG TA T T A G G A T Majority 
7640 ' 76SO 

I : U_ 



A-G TGTATTAGGAT 
AGTGTATTAGGAT 
AGTGTATTAGGAT 
A G T G T A T TA G G A T 
A G T G T A THE G Gil T 



2603_al2.seq 
18rs21_al2.seq 
515_ai2.seq 
cjbl.l l_ai2.seq 
h36b_ai2.seq 



1 
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^ age 18 



7602 
7602 
75 10 
7499 
7552 



7652 
7652 
7560 
7549 
7602 



7702 
7702 
7610 
7599 
7652 



7752 
7752 
7660 
7649 
7702 



7802 
7602 
7710 
7699 
7752 



7852 
7852 
7760 
7749 
7802 



7902 
7902 

7810 
7799 
7852 



A G'lGi'A- 

T 

7660 



CCTCTGACTTAGGATTC 
CCTCTGACTTAGGATTC 
CCTCTGACTTAGGATTC 
C C T C T G A||t TAGGATTC 
C C T C T G AD T TAGGATTC 



7670 
— I 



C C 
1 



TTACTGTCATA 

i 



G A A T T T 



7680 
i 



G 



Majority 



AATGTTGTTCTACCA 
AATGTTGTTCTACCA 
AATGTTGTTCTACCA 
AATGTTGTTCTACCA 



7690 
— I 



TTAGTGTCATAGAAT 
TTAGTGTCATAGAAT 
TTAGTGTCATAGAAT 
TTAGTGTCATAGAAT 



7700 
— I— 



AATGTTGTTCTACCATTAGTGTCATAGAAT 



T T G 

T T G 

T T G 

T T G 

T T G 



2603_al2.seq 

18rs2l_al2.seq 

515_ai2.seq 

cjbIU_al2.seq 

h36b_al2.seq 



TTACTT ATAAAA C T GTCATCTAGTTTCACATCATATGTGAGTCTTACTTT Majority 



7710 
.1 



7720 
J. 



7730 

_» 



7740 



7750 



I I a r I I a TT^JtZZTZZ AKTACTTTC A CATC AT ATGTG AGTGTTACTTT 2603 ai2.seq 
J^dJ^T^iMrJr^riJS^ 18rsIl_aiCeq 

T T A C T T A T A A A A-C T C T C A T C T A G T T T C A C A T C. A T A T C T G A C T C TT A C T T T clblll aT? <.~, 

ttactBataaaactgtc atctagtttcacatc atatctcactcttac??? Seblll^r 

TTGACCTTCTCCTAAATTCA AACCTCTAACATACAGTTTATTTCCGATGT Majority 



7760 
-i 



7770 



7780 



7790 

JL 



7800 

JL. 



3I^ A ^ TTCtCCTAAAT T C A AACCTCTAACATAGACTTTATTTCCCATCT 2603 ai2 sea 
^GACC^C^CC^A AA ??^ 

I I CHC cAt rH r? AA W^^ A M^ TCTAAC » TAGAGTTTATT T ^ C - G A T G T Si5_ai2,seq 
TTGBCCHTCHCCT AABtTC A AACCTCTAAcBtAG AGTTTATTtHIcaTG T 1x36b alZseq 



ATTCTAATTTAACCCCCTTAACTATTCCAC CA TCATT 



7810 
I 



7820 
i 



~r 

7830 
— J 



ATT 
i 



AGGCCCACC A Majority 



7840 
i 



7850 

i 



ATTCTAATTTAACCCCCTTAAGTATTCCACCATCATTATTAGG cgcacca 

attctaatttaacccccttaagtattccaccatcattattHggcccacca 

ATTCTAATTTAACCCCXTTAAGTATTCCACCATCATTATTAGGCCCACCA 

attctaatttaaccccBttaactattccaccatcattattaggcccacca 
attctaatttaaccccIttaagtatBccaccatcattattaggcccBcca 



2603_al2.seq 

18r.s21_al2.seq 

515_jai2.seq 

cjblll_al2.seq 

h36b_ai2.seq 



CTTCCAATAC 



CATC C T T C A 



TTACA 
i 



7860 



CTTCCATC A 
1 



7870 
— i 



7880 



T T T C C C T G T AAAGTATA Majority 

7900 



i 

7890 
■„...i 



G T T G C A A TACCATCCTTCATTACACTTCCATCATTTCCCTGT AAA G T A T A 
GTTGCAATACCATCCTTCATTACACTTCCATCATTTCCCTGTAAAGTATA 
G T T G C A AT A CCA T CCT TCATT AC ACTTCCATC ATTTCCCTGTAAAGTATA 
G T T G C A A Tin CMl A T CBI T T C ATT All ACTTC CATCATTTCC C TGTAAAGTATA 
G T T G C A A Tf3 C__| A T Cj_| T TC ATT A_|aCTTC CATC A TTTC C CT GTA A ACTA T A 



ATCACTTGGCTGTAATGTTTGTCCATTACCAAGCTGTAAATTGATTTTAT 

. 79 . ia 7920 . 7930 7940 ^7950 

-~. _ _, . — _j , — i 



2603_al2.seq 

18rs2I_ai2.seq 

515_al2.seq 

cjblll_al2.seq 

h36b_a!2.seq 

Majority 



ATCACTTGGCTGTAATGTTTGTCCATTACCAAGCTGTAAATTGATTTTAT 
ATCACTTGGCTGTAA KT TTGTCCATTACCA ACCTGTA AATTG ATTTTAT 
ATCACTTG G C T GTAA TfJ T TTGTCC ATT'A CC A AC CTG T A A AT TC ATTTT AT 
ATCACTTG GBI T GnAATGTTTGT.C c|Jt TIB C C AAGCTGTAA ATTG ATTT tBt 
ATCACTTG Gu T G[_| A A TGTTTGTC cBt t||cC A A g|J[t G T A A A T T GATTTTAT 



2603__ai2.seq 

18rs21_al2.seq 

5I5_al2.seq 

cjblll_ai2.seq 

h36b_ai2.seq 



CACCCATAGGATCTTCGATAGTTCCATTAACAATTGAGTTTTCTTTTCTT Majority 



7960 

JL 



7970 



7980 



i 

7990 



8000 



CACCCATAG G A T C T T C ; G A T A,G T T C C A TT A AC A A T,T G A G f T T T C T T T T G T T 2603 a!2 sea 
CACCCATAGGATCTT CG ATAGTTCCATTA AC AATTG AGTTTTCTTTTGTT 18rs2l al2sea 
CACCCATAG" GATCTT C C ATAGTTCCATTAACAATTCAGTTTTCTTTTGiT ^ C 
C A C C C A T A G G A T C T T C|I A T A G T T C C A T T A A C A A T T G A G T T T T C T T T T G T T clblll ai2 sea 
CACCCATAGGATCTT c|| A T ACTTC C ATT AAC AATTG ACTTTT.c|TTTCTT h36b_al2 seT^ 

A- A AAT.CGTTTCAAATT.GTTG CTGA ATTTTAGATAA AATTTC ATTGTTAGA Majority 



8010 



8020 



8030 



8040 



8050 



7952 AAAATCGTTTCAAATTGTTGCTGAATTTTAGATAAAATTTCATTGTTAGA 2603 al2 sea 

7952 AAAATCGTTTCAAATTGTTGCTGAATTTTAGATAAAATTTCATTG A iaStl. llTL, 

7860 A A A A T CG TTTCAAATTCTTGCTGAATTTTAGATAA A- A TTTC ATTGTTAGA 5 iTl 12 sea 

7849 AAAATCjTTTCAA A TTGTTGCTG A ATTTTAG ATAAAATTTC ATTGTTAGA cj bin at^seq 

7902 AAAATcBtTTCAAATTGTTGCTGAATTTTAGA.TAAAATTTCATTBt.TAGA tetllll 

TCTAT CCGCTCAACTTACCATACCCCTCTACTACTCACCTTTGCAACAGA Majority 
f°^2 8 0/0 * 8080 8090 8100 

TGTATCGGCTGAAGTTACGATAGGGGTGTAGTACTCAGGTTTGGAAGAGA 2603 ai2 seq 
1^ ^^^ GAACTTACCATAGGGGT ^ TAGIACTCACGTTTG GAAGAGA ISrsIl al2^eq 
Z ^W A ^^« TCAAGTT AC £ATAGGGGTGTACTA CTC AGGTTTGGAAGAG A 515 ali.seq 

^mH A JoW^ TCAAGTT AC H AT || GG GGT D TA B TACTCAGGTTT H GAAGAC A cjblll al^seq 
tHHa T CBG C T G A A C T T A cIIa TOG G G G tHt aHt A C T C A G G T T.tBg A A G A G A h36b a72 seq 



8002 
8002 
7910 
7899 
7952 
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JL 



Weighted residue weight table. 




8052 
8052 
7960 
7949 
8002 



8102 
8102 
8010 
7999 
8052 



8152 
8152 
8060 
8049 
.8102 



8202 
8202 
8110 
8099 
8152 



8252 
8252 
8160 
8149 
8202 



CT/US2005/027239 ;lg 



Majority 



8302 
8302 
8210 
8199 
8252 



A H wM CATT AGTTCTG TWA T T T cBc C A T C T G A A A C T tHa AAACCTTrr 
ABC ABB T C A T T A G T T-C T G t|| A T T T Cfl C CATCTGA AAGT TQA A A A G C T T C C 



2603__al2.seq 

18rs21_al2-seq 

515__ai2.seq 

cjblll_ai2.seq 

h36b__ai2.seq 



TCTTTC ^ ATTTTTCAAA ACTACCATCTTGATTT TTCTTATACTCCTCATT 



1 " 

8160 
I 



T C T T T C 

T C T T T C 

T C T T T C 

T C T T T C 

t cHt T C 



8170 
— I 



8180 
— i 



— i — 
8190 

JL. 



8200 
— 1_ 



A tI^I3rM^ A ^ ACCATCTTGATTTTTCTTA TACTC C T C A T T 
A A ^^^r AAAA ^ A ^ ATCTTGATTTTTCTTATACTCC ^ 

AA ^^^^^^ AAA ^^ AccATcTTGATTTTTcTTATAcTcc TCATT 

AA IIIII gaaaa ^ accatct tgatttttcttata|Jtcgtcatt 

AATTTTTGAAAAGTACCATCTTGATTTTTCTTAT Afl T C C T C A T T 




A T 
A T 
A T 
AT 
A T 



A 
A 
A 
A 
A 



A 
A 
A 
A 
A 



A 
A 
A 
A 
A 



C T 

C T 

C T 

C T 

C T 



T 
T 
T 
T 
T 



G T C 
G T C 
G T C 
G T C 
G T C 



T 
T 
T 
T 
T 



A 
A 
A 
A 
A 



A 
A 
A 
A 
A 



A C C A G 
ACCAG 
A C C A G 
ACCAG 
ACCAG 



A 
A 
A 

A 
A 



T 
T 
T 
T 
T 



A 
A 
A 
A 
A 



TATCGATAC 
TATCGATAC 
TATCGATAC 
TAT c||a T A C 
TAT c||a T A C 



C 
C 
C 
C 
C 



A 
A 
A 
A 
A 



A A 
A A 
A A 
A A 
A A 



A 
A 
A 
A 
A 



T T A A 
T T A A 
T T A A 
T T A A 
T T A A 



A 
A 
A 

a| 

A 



A T G T C A T 
A T G T C A T 
A T G T C A T 
A T G T C A T 
A T G T C A T 



2603_al2.seq 
18rs21_al2.seq 
515_al2.seq 
cjblll_al2.seq 
h36b_a 1 2 . seq 



Majority 



2603__al2.seq 
18cs2l_ai2.seq 
515_al2.seq 
cjblll_al2.seq 
h36b_al2.seq 




**^^?^""TAAACTATTTATATAAAGTTTGGTTCGTCTTCCATGT 
AATTTTTCTCTTTTAAACTATTTATATAAAGTTTGGTTGCTGTTCCATrT 
AATTTTTCTGTTTTAAACTATTTATATAAAGTTTGGTTGGTGTTCCATGT 




TCTTTCACTGCTCCATTTCGATAAATTCTACCTTTAC-CCTAATTAAGATT 
TCTTTCACTGGTC. CATTTCGATAAATTGTACCTTTAGGGTAATTAAGATT 

J^ T m T « actcgtccatttcgataaat TCTacctttagggtaattaag1t^ 

T C T1H Til ACTGGTCCATTT CQ ATAAATTGTACCTTTAGGGTAATTAAGATT 



2603_al2.seq 
18rs21_al2.seq 
515_al2.seq 
cjblll_al2.seq 
h36b_jai2.seq 

Majority 



2603_al2.seq 

18rs21_al2.seq 

5I5__ai2.$eq. 

cjblll_ai2.seq 

h36b_.ai2.seq 




TAAATCTAAATAATGAAGTT 
TAAATCTAAATAATGAAGTTTTTCTAAGTUC^A^Uniluicui^ Sl^fL, 

J AAA ^ TAAATAATGAAGTTTTTGTAAGTTTCCA ^ a ^ a t^ 5 1i^ 

t AAA ^o^ A ^ cj bU J a??sea 

TAAATCTAAATAATGAAGTTTTTGTAAGTTTCCAGAllATTATCTGTGTT? h36bal2^ 



8352 
8352 
8260 
8249 
8302 



I^^Te^T^ 2603_al2.seq 
LJJJcT A JrJ ^rMJntf! &TAACT ^ C C CCATTTCCTTTTATA ISrs^al^Tseq 
G A T A A C T A T C I AAGGGAAA n A AAAAGTAACTCTCCCCATTTCCTTTTATA 5islal2,seq 

GAJMc^A^r^ArrrAw^xM ATTXUTTTTAT AcJbllLaaseq. 

GATAACTATCTAAG. GGA A AC A A A A A G T A A C T C T C C C C A T T&G G T T tQ A TH f-36b a!2 seq 



8402 
8402 
8310 
8299 
8352 



8452 
8452 
8360 
8349 
8402 



TCCTCGCCCTTtTrArTii^Tifiili^ ^" ^*^'^^*''*T^^CCATT 2603.aI2.seq 

^cH^c^Ga^^^ A ^ CAG ^ A ^ G ^ ACAA ^ ATTAcTTTTAT ' ^TAcATATccA TT is^i^itl, 

TGC T C <5 G CCTI A T C A G T A A G T AGAAAATTACTTTTATTTAfi/tTATrr att ^.iki > i -7^ 

T CH G GQ tH A T C A G tH AQ A II A A T T A|| TIT T_J TH T aE A T AQ C C_| T T h36b^.aT2!seq^ 



TTTTTT C A T T T G 
1 



TTCAAATTGGC TTTCATATGATCCACCCAGTTTAAAAT 
8520 8530 8540 8550 



Majority 



jIIIII?JIII5IIJ*'*IJ 6 '' T T IC * Tl ^*''T' ! CCCAGTTT aXa^A^ 2603 al2. seq 
TTTTTTCATTTGTTCAAATTGGCTTTCATATGATGCACCCAGTTTAAAAT 

t tW t t tH a t Tionn t r ^ ^ ^ *r G sni •!* G w T A **JL Ji ACCCAGTTTAAAAT 
t tQt t tHa t t_D1t c a a at t cIBlTTn cIH tat cQ tcQa c cHH tEH A a a aI 



515_al2.seq 
cjblll_al2.seq 
h36b_al2.seq 
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8502 
8502 
8410 
8399 
8452 



8552 
8552 
8460 
8449 
8502 



8602 
8602 
8510 
8499 
8552 



8560 



A CCAT 



8570 
I, 



8580 
— i . 



CACTTACATG A A C 
j — 



A A T A 
r- 



Majorlty 



8590 



ATT ^ I TCTCGTACCUCACCATCAGTTACATG AAC 

?A^AA^AGCA^^r^^^ TAGGAACACCAT 
TAT^A A ^nr^ A J A J^ ATCT « GT ^ GGAACACC ^ TCA 

tattaatEgcaiatgatc tBg tBccaacac cHt cagttaHatgaac 



8600 
I— 



A A T A 

A A T A 

A A T A 

A A T A 

A A T A 



attttttgacta tttccatttacttgactca 

8610 r 
JL 



A A 



8620 
— I 



8630 

i 



ATATCATCTGC C T CCAT 
I — r 

8640 8650 
— 1 __. ■ 



2603_ai2.seq 
18rs21_al2.seq 
515_al2.seq 
cJbH l_al2. seq 
h36b__aI2.seq 

Majority 



T TUCTCamF " CAI " C »"CAT 2603 ai 2 . Se o 

A ^??^C A c?A^^rr A ^^ A ^ TTCACTCAAAATATCATCTGCCTCCA T S IS_aii .seq ^ 

a tQ t t tDDI a c tHt tHc gWtIHt .mm * r In A A A A _ TA L. C - a-tctccctccat cjbm_a!? seq 

U 1U|U TUT THC GHTIBT AUGDu A C Til A aQ AQQQQ A T C T G C C T C C A T h36b_.al2.seq 



CAAGCCTTTCATAGTAAATCTTTCT CCTACTTTACTAACATAGTACTCCT Majority 

8660 8670 8680 8690 8 TOO 



». A A O G C T T T C A T A-G T A A A TG T T T C T C C T A C T T T A C T A A C A T A r T A r x r r- t •<> .o 
GAAGGCTTTCA T A GTAAATGTTTCTCCTACTTTACTAAGATAGTACTCCT . f f 

i:M:n;niu»i:ii:a;si::;:n;;::s«tt:ii;uiisi^r 



TTTGTTGCTCTGGAGTTA A T r r A *r^? GG Z A 9,'*' AGA '*' GG *' GAGT *''AGCTTTA I& ^OH2.2 B , 

8549 TTTCTTG'CTCTCCAfTTijTfr.TTrrT^ ^ SIS-.al2.sea 
oaijj ii i u i V fa 1> AC lGGAGTTAATCCATTGGTAGTAGATrrrrArTTAr > r'TT*r* < , 

8602 TD T D T- tBB T C T G G A G T T A A T C C HEEH T Q C T .HUI C C C^A c" A CC "ffl 



8702 
8702 
8610 
8599 
8652 




8752 
8752 
8660 
8649 
8702 



TTCTTTATMPTATiiT^ 

II G III A I GAC I A I AA TTCTCTGTCTGAATTGTGAACTTAGTTTGAAGGC 
R tCTt^ T A tHH J * J t * I|K T £ TG T C T G A AT T G T G A ACT TAGTTTGAAGGC 

HthtttatEDct at a atBHtIItgtc tgaaBtctgaaHIta gtttBa agBc 



2603_ai2.seq 
18rs21_al2.seq 
515._ai2.seq 
cjblll„al2.seq 
h36b_ai2.seq " 



8802 
8802 
8710 
8699 
8752 



r A ^ AA 3 A 3^ ATCATCTTCT T t A A A T C C T T T T A C G ACATCTACA C 
CAUATATTTATCATCTTCTTTAAATCCTTTTACCACATCTACAC 
CAT.AATATTTATCATCTTCTTTA.AATCCTTTTACGACAT T C CTCCTa" 

C A T A A T ABBH Am! A T cflT CTTTA A AfJ C C T T T T A EH A cSI TBB A cl 



T C C T 



2603_ai2.seq 

18rs.21_.ai_.seq 

515_.at2.seq 

.cjblll_j_12.seq 

;I&6b._ai_^seq 

Majority 



8852 
8852 
8760 
8749 
8799 



8902 
8902 
8810 
8799 
8849 



C c A ?rt AAA J T A T ^i^ AACCATAGCTAAC T AA TGCAACCCTATTATCACT 2603_ a i2.s« 1 

CC ATCAAAAATAT C T g A A ^ r -A ^ A r r t A A ^ A A ^ ^ ^ A A ^ ^ ^ A ^ ^ A ^* A ' *8rs_Sl_jal2?seq 
rr A TrA AAA «^ A ^^ AA CATACGTAACTAATGCAACCCTATTA TCACTSlS al2.seq 

CCATCAAAHTATfTriirfi T ' A _H *» T * A C T A AT G c /^A ' C1\AT T A T C A C X Cjb7ll_al2?seq 
CCATCAAAAATATCTCAACCATAiGTAACTAABG COB C._C__T t|It C A__| h36b._al2.seq- 

CTTTGCTCCTAAAATATCTTTTACT C CCGTCnr. A AGACCTT C GGCAGCTT Majority 



8960 
JL 



8970 
j. 



8980 



8990 



9000 



llllclllll * * A A I A I ^ I T T T A C T G C C C T ^ C C A A G A G C T T C G G C A G C T T 2603 ai2.seq 

G^?GC?^r? AAAA ^ A ^I TTTACTGCGGTCCCAAC 18rs£l_al2^L 
G III GG I CC I AAAA I A ICTTTTACTGCGGTCCCAAGAGCTTCG a 2.seq ^ 

BtTT6CTCCTAAA A !I A ?rn^ A -™^ GT ^S C A A ^Jl GCTTCGGC AGCTT cj bTl l_a iTseq 
MTTTGCTCCTAAAAHatCTTTT Al^^mG TlVf Cll AlMl G C T T CHH C|] G CQ T hWa^.seq 
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8952 
8952 
8860 
8849 
8899 



9002 
9002 
89 10 
8899 
8937 



9052 
9052 
8960 
8949 
8987 



9102 
9102 
9010 
8999 
9037 



9i52 
9152 
9060 
9049 
9087 



9202 
.9202 
91 lO 
9099 
9137 



9252 
9252 
.9160 
9149 
9187 



9302 
9302 
9210 
9199 
9237 



T C T T G 



BBS Hf A'TlaJllari &..fcibr i>f t± J?m$ a::(r%jiT cccccatccttattcattca c 

k - ' " ' ' '" 1 "'I | 



Majority 



9010 
i 



9020 
I 



903O 
« 



9040 



_ _ _ _ _ 9050 

IS I I r rc I I ^^ItI^oIII GAAAATTTGGGCCATCGT ^ ATTCATTGA ^ 2603_al2.seq 
Tc^Crr^? A ^ A I GC SIII CAAAATTTGGGCCATCG ^ 18rs21_al^seq 
^^I G ^ C III ATTATGCCTTTGAAAATTTGGGCCATC€TTATTCATTG AG515 al2.seq 
TCTTGGCTTTATTA T GCC TT TG AAA A TTTGGC C CATCGTTATTCATTGAG cjblll ai^seq 



TH T t|| G CD T T T A T| 



T T 



C C A tBHt tQt TCATHG a£! h36b_a!2.seq 



TTAGAATTATCGAGTAC GAAGACAACATCTAACCGCTTTTGTTTGTCCAC Majority 



9060 



9070 



9080 



9090 
a. 



9100 



^ A r AA JI A I^ A ^ TACCAAGACAACATCTAACGGCTTTTGTTTGT 2603_ai2.seq 
^I AGAA 3I A I C 5 A TACGAAGACAACATCTAACGGCTTTTGTTTGT CCAC 18rs2I ai2.seq 
TTAGAATTATCCAGTACCAAGACAACATCTAACGGCTTTTGTTTGTCCAC 515_al2 sea 
TTAGAATTATCCA G T A C G A AG ACAACATCT A AC C GCTTTTCTTTGTCCAC clblll ai2 sen 
T TUG A A XT A T cB A GO A cH A aH A C A A C A T cBaHKIg gH tIK|t|I T T tB tOD AD 

TGCTTTTACTATCGTTTTT CCACTCACAGTTAACTCAATTTTATATTTAT Majority 



9110 



9120 
_i_ 



9130 
t 



9140 
* 



9150 
a. 



IS G nn AG I ATGGTTTTTCCACTGACAGTTAACTCAATTTTATA TTTAT 2603 al2.seq 
I G 5II" ACTATGGTTTTTGCACTGA CAGTTAACTCAATTTTATATTTAT ISrsil alHeq 
TGGTTTTACTATG GT TTTTCCACTGACAGTTAACTCAATTTTATATTTAT 515 ali sea 
TGGTTTTACTATGCTTTTTCCACTGACAGTTAACTCAATTTTATATTTAT clbill ai2sea 
BcnXTTTAnTATCGBTTTHccBCTBACACTTA ACTCA ATTTTATATTTAT bM^i^* 

TATCAGCTAAATCACCTAC TTCTGAAATACGTTTAGATAATGTTCCCTCT Majority 



9160 



9170 



i 

9180 



9190 



9200 
_i_ 



TATGAGCTAA ATC ACCT ACTTCTG AA AT ACGTTT A OA TA ATCTTCCCTtT 2603 ai2 seq 

^ A I GAGC I A ^ ATCACCTACTTCTGAAATACGTTTAGATAATGTTC CeTCT ISrsil ai2.seq 

TATGAGCTAAATCACCTACTTCTGAAATACGTTTAGATAATGTTCCCTCT 5 15 a!2 sea 

I A I C A Hrra t * A I S A MnI A "TCTGAAATACGTTTAGATAATGTTCCCTCT cjb7u_al2 seq 

I A I G AH CHAAATC AH! T ACTTC TG A A ATACGTTTAG A TA ATGITC CC TC T h36b_al2 seq 



GGAATTTCTCTTATATGCTCACCTTCACTT GAATATGGGT T AACTGCTTT Majority 

9250 



9210 
i 



9220 
— i 



9230 
— I 



T 
9240 



GGAATTTCTCTTATATGCTCACCTTCACTTGAATATGGGTTA ACTG C T-T T 
GGAATTTCTCTTATATGCTCACCTTCACTTGAATATGGGTTAACTGCTTT 
GGAATTTCTCTTATATGCTCACCTTCACTTGAATATGGGTTAACTGCTTT 
GGAATTTCT CTTATAT GCIC AC CTTC AC TTG A AT A TG GGTTA ACTGC TTT 
GAATTTCTCTTA tBt GCTC ACCTTCACTTGAATATG gBtT AACTGCTTT 



2603_ai2.seq 

18rs21_ai2.seq 

515__al2.seq 

cjbill_al2.seq 

h36b__al2.seq 



JLGC C T C T G AC T TTCCATTTCCA ACTGA ACCTTTAACATGC TCA A G T T T A T 



T 

9260 
I 



9270 
t 



i 

9280 
t 



9290 



Majority 



9300 
l_ 



TGCCTCTGACTTTCCATTTGGAACTGAACCTTTAACATGCTCAAGTTTAT 
TGCCTCTGACTTTCCATTTGGAACTGAACCTTTAACATGCTCAAGTTTAT 
TGCCTCTGACTTTCCATTTGGAACTGAACCTTTAACATGCTCAAGTTTAT 
T G C C T C TG AC T T TC CATTTGGA A£TGA A CC TTTAA CATGCTCAAG. TTTAT 
T G C C T C THaIIt Tfljc CATTTGGA A || T G A A|B T TT A A C A TG C T C A A g||t T A T 

AAGAT TCCTTTGTATCTTCATAAATTCCTGTCGG GGGATACTGC 

T 



2603_al2.seq 

18rs21_al2.seq 

S15^_ai2.seq 

cjblll_al2.seq 

h36b_ai2.seq 



9310 
t~ 



9320 
I 



T T A T C T 



Majority 



9330 



9340 
I 



AAGATTCCTTTGTATCTTCATAAATTCCTG.TGGGG 
A A G A T T C C.T T T G T AT C T-T C A t A A* A T T C C f G TG GG G 
A AG ATTCCT'TT G'T ATCTTCATAAATTCCTGTGGGG 
A A C A T T C C T T T G T A T C T T C A T A A A T t C C T G T; G C G G 
A AG A T T CflT TTGTATC.TTC A T A aBHt C C TG tHH G G 

AGTTC TTCCTGATTTTGTCCAATTiGTOGA ATT TTT 



.9350 

L. 



9360 
— I 



9370 
« 



~r 
9380 



GGATACTGCTTATCT 
G G.A T ACT G C T T AT C T 
G G A T AC TGCTTATCT 
GGATACTGCTTATCT 
GGATACTGCTTATCT h3Gb^ai2.seq 

ATCACCACTATTTTG Majority 



2603_al2.seq 
18rs21_ai2.seq 
515_al2,seq 
cjblll.jai2.seq 



9390 



9400 



AGTTCTTCCTGATTTTGTCCAATTGTGGAATTTTT, 
AGTTCTTCCTGA.TTTTGT CCA ATTGTGGA ATTTTT 
ACTTCTTCCTGATTTTGTCCAAT.TGTGGAAtTTTT 
AGTTCTTCCT G AT TTTGT.CC A ATTGTGGAATTTTT 

ag t t cHt c|It gBBt t tgt||caat tBtBg aQ tttt T 



ATCACCACTATTTTG 
ATCACCACTATTTTG 
ATCACCAC T A T T T T G 
ATCACCACTATTTTG 
ATC AH CACTATTTTG 



2603^al2.seq 
1 8rs21_al 2. seq 
515_al2.seq 
cjbtll_ai2.seq 
h36b__aI2.seq 



9352 
9352 
9260 
9249 
9287 



TATCGTAGTTTT.TCCATTAC TCTCAACCTTAACTTGCCAAGTCTGCT. TAG Majority 
d4 }° 942<> 943Q " Q440 9450. 

I A I C ? TAGTTTTTCCATTACTCTCAACCTTAACTTGCCAAG TCTGGTTAG 2603 ai2.seq 
I A I CG I AGTTTTTCCATTACTCTCAACCTTAACTTGCCAAGTCTGG TTAG 18rs21 alHeq 
TATCGTAGTTTTTCCATTACTCTCAACCTTAACTTGCCAAGTCTGGTTAG515 a 12. seq 
I A I^! UTTTTTCCATT . A CTCTCAACCTTAACTTGCCAAGTCTGGTTAG cjblil a 12, seq 
TATCGTAGTTTTTCCATTACTCTCAACCTTAACTTGCCAAGTCTG gQ TAG h36b_al2.seq 
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9402 
9402 
9310 
9299 
9337 



9452 
9452 
9360 
9349 
9387 



9502 
9502 
9410 
9399 
9437 



9552 
9552 
9460 
9449 
.9487 



9602 
9602 
9510 
9499 
9537 



9652 
9652 
9560 
9549 
9587 



9702 
9702 
9610 
9599 
9637 



9752 
9752 
9660 
9649 
9687 



9802 
9802 
9710 

9699 
9737 



T C T T 



¥t~T 



AC 



'T"C 



G"G €/ { 'C t C'T-T-T' C"T TCTGATA AA C T A T A A T C 
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9460 
— i 



9470 

i 



TCTTTTTATA 
TCTTTTTATA 
TCTTTTTATA 
TCTTTTTATA 
T C T T T T TW T A 



A 
A 
A 
A 



CCTTCGGGC 
CCTTCGGGC 

ccttcgggc 
ccttcgggc 
ccttccggI 



9480 
— I 



G C T G T T T 

G C T G T T T 

G C T G T T T 

G C T G T T T 

G cH G T T T 



-r 

9490 
i 



T C C A 
r 



Majorlty 



CTTCTGATAAAGTATAATC 
CTTCTGATAAAGTATAATC 
CTTCTGATAAAGTATAATC 
CTTCTGATAAAGTATAATC 
CTTCTGATAAAG T__| T A A T C 



9500 
i_ 



T C C A 

T C C A 

T C C A 

T C C A 

T C C A 



2603_ai2.seq 
18rs21_al2.seq 
515_ai2.seq " 
cjblll_ai2.seq 
h36b_a!2.seq 



GGTATGACATTATCAAA ACTAGCTTCACCTCTTAGCTCAGCAGTTACTTT Majority 



9510 



9520 

JL 



9530 



9540 



9550 

_L 



CGTATGAGATTATCAAAAGTACCTTCACCTGTTAGCTCAGCAGTTACTTT 2603 al2 seq 
GGTATGAGATTATCAAAAGTAGCTTC'ACCTGTTA GCTCAGCA GTTACTTT 18rs2i ai2seq 
GGTATGAGATTATCAAAAGTAGCTTCACCTGTTAGCTCAGCAGTTACTTT 5I5_ai2 seq 
G G TAT GAGATTATCAAAAGTAGCTTCACCTGTTAGCTCAGC A GTTACTTT clblil ai2seq 
G G TfflTG AGATTATCAAAAGTAGCTTCACCTGTT aQc T C AgQa GTTACTTT h36b ai2 seq 



TTCTATTTT 



ACTTTCTGGATGACC A G T A G 
i 1 



9560 

L 



TTTTT AAAAC AAAGGTAGCTT Majority 



9570 

JL 



9580 

JL 



9590 



9600 



TTCTATTTTACTTTCTGGATGAGCAGTAGTTTTTAAAACAAA-GGTAGCTT 
TTCTATTTTACTTTCTGGATGACCAGTAGTTTTTAAAACAAAGGTAGCTT 
TTCTATTTTACTTTCTGGATGACCAGTAGTTTTTAAAACAAAGGTAGCTT 
T T C T AT T T T A C T T T C T G G A T CAGCA GTAGTTTTTAAAACAAAGGTAGCTT 
TTCTABtTtHcTTTCT gHBt G|lGl!A G T A gDtTTTA AAAC AAAGGTAGCTT 



2603_al2.seq 

18rs21_aI2.seq 

515_ai2.seq 

cjblll_al2.seq 

h36b_al2.seq 



TTGAAAGTCCTTTCTTCTG 

9610 9620 
— 1 : i__. 



G T C A T C T G 

T — 



T-CTTTTTAACAACTAACTTTC 
"9630 9640 

1— : • i , 



TTGAAAGTGGTTTGTTCTGGTCATCTGTCTTTTTAACAACTAACTTTCCT 
TTGAAAGTGGTTTGTTCTGGTCATCTGTCTTTTTAACAACTAACTTTCCT 
TTGAAAGTGCTTTGTTCTGGTCATCTCTCTTTTTAACAACTAACTTTCCT 
TTGAAAGTGGTTTGTTCTGGTCATCTGTCTTTTTAACAACTAACTTTCCT cJbllLalB.seq 
TTGAAAGTGGTTTGTTCTGCTCATCTGTCTTTTTAACAACTAACTTTCCT h36b_ai2 seq 



C T Majority 

9650 
i— 

2603_ai2.seq 
18rs21_ai2.seq 
515__al2.seq 



TTAGCACCATTTTCCGGTACGGTACTTTCCC C T A A A A C A 



9660 
i 



TTCGTATTA AG Majority 



9670. 
— i 



9680 



i 

9690 



\ 



9700 



TTAGCACCATTTTCCG.GTACGGTACTTTCCCCTAAAACATTG G T A T T A A G 
TTAGCACCATTTTCCGGTACGGTACTTTCCCCTAAAACATTGGTATTAAG 
TTAGCACCATTTTCCGGTACGGTACTTTCGCCTAAAACATTGGTATTAAG 
TTAGCACCATTTTCCGGTACGGTACTTTCCCCTAAAACATTGGTATTAAG 
TTAGCA.CCA-TTTTCC.GGTACGGTA-CTTTCCCCTAAAACATTGGTATTAAG 



2603_lai2 .seq 

18rs21_ai2.seq 

515_al2.seq 

cJblU_ai2.seq 

h36b._al2.seq 



CGGTAT. TTGCGACAAACAAA AAAGA C T T A 

1 



9710 
I 



CGGTATTTGCGACAA 
CGGTATTTGCGACAA 
CGGTATTTGCGACAA 
CGGTATTTGCGACAA 
CGGTATTTGCGACAA 



TT 

9720 
t 



A.CGTCAATATTTTAGAAAATT Majority 



9730 
f 



i 

9740 



9750 
i_ 



ACAAAAAAGACTTAACGTCAATATTTTAGAAAATT 
A C A A A A A AG ACTTAACGTCAATATTTTAGA AAATT 
A C A A A A A A G A C T T A A C G T C A A T A T T T T A G A A A A T T 
ACAAAAAAGACTTAACGTCAATATTTTAGAAAATT 
ACAAAAAAGACTTAACGTCAA TA TTTTAGAAAATT h36b_ai2.seq 



2603_.al2.seq- 
18rs21_ai2.seq 
5I5_.ai2.seq 
cjbl 1 l_a!2.seq 



TTTGCTATT T T_C T C A T T T T ACAAC.TC CT ATTCTCCCGA A ATGTC GTTTCT Majority 



9760 
— I 



9770 

; J — 



T 

9780 



9790 
— i 



9800 
» 



T T T G G T A T T T T C T C A T T T T A C A A C T C C T A T T G IT G C C G A A A T G T C G T. T T C T 
T T T G G T A T T T T C T C A T T T T A C A A C T C C T A T T G T G C C G A A A T G T C G T T T C T 

T T T G G T A T -T T T C T C A T T T T A G A A C T**G C T A T T G T G C C G A AA TGTCGTTTCT 
T T t.G C T A T T T T CTCATTTTA G A AC TCCT ATTGT6CCGAA ATCTCGT.TTCT 
T TTiG GT A T T'T T C J T C A.T J T T A C A A C T G C TAT T G T G C C G A A A T G T .C (?TT T C T 



2603_ai2.seq 

18rs21_.al2.seq 

515_ai2.seq 

bjblll_£l2.seq 
h36b_al2;seq 



A A A T C T A A G A T C AG ATACA.G A A T A 

■ 



— r 

9810 
— ti 



9820 
i 



T.CCTA CAATAT, ACAA ACTA TG ACTT A 
i , 



Majority 



9830 
i 



9840 



9850 
— 1_ 



A AATCTA AC ATC AGAT AC AG A AT A TCCT AG A AT AT ACAAACT ATC ACTT A 

AAATCTAAGATCAGATAC.AGAATATCCTAGAATATACAAACTATCACTTA 
AAATCTAAG'AT.CAGATACA.GAATATC. CTAGAATAT aRa A ACTATC ACTT A 
AAATCTAAGATCAGATACAG A. A TATCCTAGAATATACAAACTATCACTTA 
AAATCTAAGATCAGATACAGAATATCCTAGAATATACAAACTATCACT 



T A 



2603_ai2,seq 

18rs21_.al2.seq 

515_.al2.seq 

cjblll_al2.seq 

h36b_.ai2.seq 



T T ATGAT.ATC A ATAATTTCTTATTATAA GGTATCG AA T T T T 



T 

9860 



9870 
i 



9880 



A A 



TGTTTT T 
r- 



Majorlty 



9890 



9900 
t— 



TTATGATATCA A.T A ATTTCTTATTAT AAGGTATGGAATTTT AATGTTTTT 
TTATGATATCAATAATTTCTTATTATAAGGTATGGAATTTTAATGTTTTT 
TTATGATATCAATAATTTCTTATTATAACGTATGGAATTTTA aB GTTTTT 
TTATGATATCAATAATTTCTTATTATAAG. GTATGGAATTTTAATGTTTTT 
TTATGATATCAATAATTTCTTA.TTATAAGGTATCGAATTTTA AH GTTTTT 



2603_ai2.seq 

I8rs21_al2.*seq 

515_al2.seq 

cjblll_ai2,seq 

h36b_a!2.seq 
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# mm 

metjWvath Weighted residue weight table. 



9852 
9852 
9760 
9749 
9787 



9902 
9902 
9808 
9799 
9837 



9952 
9952 
9856 
9849 
9887 



10000 

10000 

9903 

9897 

9937 



CT/US2005/02723923 



Majority 



T C C C A r! W ; Ti , T „W M jATTT c ATAATCTTATTTTTTAT 

9910 9920 9930 9940 9950 



^CCCAA^^ ^rtf^M^ TTTCTTTTTA ^ TTGATAATC TTATTTTTTAT 2603 al2 seq 
TCrCAATTTTTTrA^^ ^^"^^"ATAATCTTATTTT T T AT 18rs2~l a Heq 

TCCCAATTTTTGAATHATTTTTCTTTTTATTTCAT ABB CH TIKI T Tlfi TlSia «i*ilift 

TCCCAATTTTTGAAtBatTTTTCTTTTTATTTGATAATCTTATTTTTTAT h36b_aT2!seq 



T A T C T T A 



GAAATATTTCAA 



9960 
t 



T G A G C T T A A G T A 

i , 



GTTGATTTTTC TTTTTTTA Majority 



9970 
J 



9980 
— 1 



9990 
1 



10000 
J. 



lH^^!fM!I!!.JIJwr CUCTTAACI4GITCATTTTIC " TTTTTA 
TAtEmtMpaa t-WM ~ A -^- T -XA.AGTAGTTC ATTT TfCTT T,TTTTA 

tat cHtUc a a aDBtHtHa a tH a Hi t|J aII c iactt c IBHB t tBH t HMIWIII 

TAT.CTTACAAAT AT TTCAATGAGCTTAACTAGTTCATTTT T CT T TTTTTA 

T A T.C T T A G A A A T aQ t t c a a t|| agcttaagtagttgatttttcttttttta 




2603_ai2.seq 

18rs21_ai2.seq 

515_ai2.seq 

cjblll_ai2.seq 

h36b_ai2.seq 



Majority 



TC?tttaa^a^^^^M AAATAATCTTTGAGAGAG -- TTTAC TO A. AT 2603 al2.seo 

ji \\$m* HUSH A i^WrVrMSWd: - H t t a cQQD a t 

TGTTTTAAAATATTGCTTAAAAATAA gQQ T T T A C T G A AT Meb^aTz^s^q^ 




t"ttL A AAATIATTT A ] AAAA ^^ ATCCTTAATCA " 2603_.aA2.seq 

t gqq th aaa ah th t tbdd ad ah adoi en tbdbo a a th a aQ td a a -Emma IT^ 12 ^ 

TGATTGAAAATTATTTAGAAAA A r A T (^C^T^T A A T C A A A T A A A A^^T^T^C^^A^ CJbTl'l^U-^seq 

TGATTGAAAATTATTTAGAAAAACACATCCTTAATCAAATAAAACTTCTA h36b_al2.seq 

ACTTTATGCTATGATTACTACCCTT CCATTACTCTACACAAATCATCTGA Majority 



iOOSO A C T T _ T A T C J* T A 1° " " C ? ' C ° C " C »" » C T "< C A C [a A T C A TCTC I 2603 a!2 seq 

3S" Mm r a ran c t a tbqq t moms c t th en n'c I mm a a b a a a b t t c c n T n T ?m ST V 3 " 

as*? actttatgctatgat tTcTa^c t t c c a t t a c t^TTg'aca a at cat ct?? t ,tf, 1? 
soar actttatgctatgat.tactacccttccatt^c^ctagacaaatca^c^ca S J 36 b "l 2 :f e r 




10100 T C A A C.TTG 

10100 TCAACTTG 

10003 tB a||C;~ 

9997 TCAACTTG 

10037 TCAACTTG 



G T T T 
GITT 

T T 

C T T T 



ATCTGAACTACTTATTAGGA 
A T C T G A ACTA C T T A T TAGG A 

- c t g aDBQDcDt a t BBPO A 

ATCTGAACTACTTATTAGGA 



2603_ai2.seq 
18rs21_ai2.seq 
5I5_al2.seq 
cjblli_al2.seq 
h36b_ai2.seq 



ACAACTTTATTTAAC 

t 



AGTCAACTCTC 
1 . — — - — 



10210 



TCTGAATATCGAA 



10149 
10149 
10053 
10046 
10086 



ACAACTTTATTTAAC 
AC A A^C TT T A TT T A A€ 

aB a a D tB tflODD A A 

A C A. A C T T TA' TfT A AG 
A C A -A C TT TAT TT A A C 



10220 
I 



A A G 

T :— 



.10230 



AGTACAAT 

i 



Majority 



10240 



AGTCAACTCT.CTCTGAATATCGAAAAG 
I£ A A r_ C _5 ' C - ^ ^ CTCAATAK GAA.A AG 

dodd a ahh td Tim addds cm a a a g 

A G T C A A G T C t-G T CT G A AT ATCCAAAAG 
A G T C A A G TC TCTC T G A A T A T C G A A A A G 



A G T A C 
A CT A C 

a BPB I c|3 

A G f A C 
AGTAC 



10250 

L 



2603_ai2.seq 
18rs21_ai2.seq 
515_ai2.seq 
cjblll_al2.seq 



A A T 
A A T 

_T 

A A T 

A A T H36b_at2_seq 



10199 
10199 
10103 
10096 
10136 




Majority 



A G T C T A C C A A T C T A A 
A G T C T A C C A A T C T A A 

PPHO t aB c iBH c t 

A G T C T.A C'CAATCTA A 
AGTCTACC AATCTA A 



TGGT GTAACTAGAGAACA 
T G G T G T A A C T A G AGAAC A 

tOQDBtIBa cQa cinSEDA 

TGGTGTAAC.TA G A G A AC A 
TGGTGTAACTAGAGAAC A 



A G CT T T C A A 
AGCTT T C A A 

aiubt t bbd 

AGCTTTC AA 
AGCTTTCAA 



10300 
1. 

ATATATTT 2603_ai2.seq 
A T AT A T T T 18rs2I_ai2.seq 
A THH A T T T 515_ai2.seq 
ATATATTT cjbl H_ai2.seq 
ATATATTT h36b_ai2.seq 



ATCAT CAATCAC. ACGTTTTACAACTT TT A A A A 



10249 
10249 
10152 
10146 
10186 



10310 

t 



10320 
i _ 



10330 



TTTTTGATC ACGAA 



T C A T 

T" 



Majorlty 



10340 
i 



10350 
— c 



A ?r A ?r AA 3r A nA^ TTTTACAACTTTTAAAATTTTT 

n Imn n n r nm' I T T AC AACTT T T A A A £-LLL T tgatcacgaatgat 
T^^^r a ? $ir*?&Zl r Z AnHAHBSBxT a -a aBEEHt tHa t cQcS3Bt g aH 

AT^llr a f IS ^ A ^ TTTTAC A ACTTTTA AAATTTTTG ATCA cTTT T G AT 
ATCATCAATCACACGT.TTTACAACTTTTAAAATTTTTGATCACGAATGAT 



2603_ai2 .seq 

18rs21_ai2*.seq 

515_ai2.seq 

cjbill_ai2.seq 

h36b_ai2.seq 
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2006/078318 



JL 




Weighted residue weight table. 



10299 
10299 
10196 
10196 

10236 




CT/US2005/027239 2 4 



Majority 



TCAGGCAGGTTACCTTTAACTTACTTTAGTGAAA 
S C AGGCAGGT T A C C T T T A ACTT A C T T T AG T G A A A 

t tBc c t tBaIMBHEI a c tB tHHtHBaI 



TCAGGCAGGTTACCTTTAACTTACTTTAGTCAAA 
TCAGGCAGGTTACCTTTAACTTACTTTAGTGAAA 



C A 



a i r i r a 




A A T T T G G ACTATCTTG 2603_ai2_seq 
A ATT T CG ACTATC T TC 18rs21_ai2.seq 
AUiU i HS AH T AH C THH 515_al2 .seq 
AATTTGGACTATCTTG cjbi I l_ai2.seq 
AATTTGGACTATCTTG h36b_ai2.seq 



Majority 



10410 



10349 
10349 
10246 
10246 
10286 



10420 



10430 
1 



TGCAACTGCTTATCGCATACGAAAACATATTAGT 
T B G i r i H A A C T G C T T _A T C c C AT A||g AAAACATATTAGT 

^M^OlfeiMM TjM^t»1 cBS auB a IB A c aH a t tB g t| 

TGCAACTGCTTATCGCATACGAAAACATATTAGT 
TGCAACTGCTTATCGCATACGAAAACATATTAGT 



10440 
* 



ccattact 
c c a t t a c t 
DDaQt 

ccattact 
c cht tact 



10450 
1— 



AGAAAAAC 
A G A A A A AC 

aDBSHaISI 

AGAAAAAC 
AGAAAAAC 



2603_ai2.seq 

18rs21_aI2.seq 

5I5_al2.seq 

cjblll_ai2.seq 

h36b_ai2.seq 



10399 
10399 
10291 
10296 
10336 



TTCGATTTCAGATTGTCAAAAATACTATTA C C G G 

i ■ i : — 1 

10460 10470 10480 

1 i -L 



TC - 



ACGACTAT C 
■ —i 



C A AT Majority 



TTGGATTTCAGATTGTCAAAAATACTATTACCGG 
M T M G > A J-Ft T C A C A TTGT C A A A A A T ACTA T T A C C G G 

ES THH a QE > UB1 aDQBD c aaQaB tDBDD tDD c cRI 

TTGGATTTCAGATTGTCAAAAATACTATTACCGG 
TTGGATTTCAGATTGTCAAAAATACTATTACCGG 



10490 



TG-ACGAGT 
T G ~ A CGAGT 

t G BBj c BMBD! 

TG-ACGAGT 
TG - ACGACT 



10500 
i_ 



ATCGA 
A T C G A 

tB g a 

ATCGA 
ATCGA 



A T 
A T 

aB 

A T 
A T 



2G03_ai2.seq 

18rs21_ai2.seq 

515_ai2.seq 

cjfalll_al2.seq 

h36b_a!2.seq 



10446 
10448 
10341 
10345 
10385 



TCGCT ATTTAATCGCATTTTTAAATGC TC A A T TT 

10510 10520 

— l - : L 



T 

10530 



GGTATAGAACTTTATC Majority 



10540 



10550 



?C G C T A I I T A A I r rn tZllZ I M A ^ G CTCAATTT GGTATAGAAGTTTATC 2603.al2.eq 

m*m I A Tim A A nm c A Zw* T .IL-LI G £ T ££ A ii T £ G TATA * * A G T JLJJL T c ^i-*^^ 

™™* T A T DK1A AEESlC A THT TPoBBM gB t||| A EE tB (-BQ1 gIaH TBBB tfl SIS ai2 sea 

t^gct'at^t'aItT^ 

TCGCTATTTAA1CGC ABET TTAAATGCTcHaTTTGGTATAGAAGTTTATC h36b_ai2.seq 



10498 
10498 
10389 
.10395 
10435 



CCATGTCTAAGATGGATAAATTGCTC A T C 

i ~ — i 1 

10560 10570 

t 1 



AAACGAT T G T T A 
— i ■ j 

10580 10590 
i - » 



TTAGAACAC 
' r 

10600 
i 



Majority 



Ccl?rx^ A Ar A ^^ A ^M A ^ TGCTCATCAAA AAC AC 2603_ai2.seq 

cnmiTnT m *R°n A *s A A A « t && t « a £&£ A \£ G £ T T ^ T i A TTACA AC ac isr^^.seq 

cm»iitBt \DH aBDHaBa a aB tHItE aBOT a aQgIIt tKI tW a BMBBPBMMB sis a i2 sen 

C cDtBt CTAACATCrATA/i a" - t t T G A ^ C A A A C G A T T^T t7 a\TaTaTcaT c bU S seq 

C CBTHT CTAACATGGATAAATTGCtIIaTCAAACGATTGTTATTAGA ABA C h36b_ai2.seq 



TCAACTACTTTTACTGC TTCTCATTACTTCC-CAA A t a c 



10610 



10548 
10548 
10439 
10445 
10485 



10620 
1 



A T 



TCAACTACTTTTACTGCTTCTCA TT A 
T C _ AA _ C T A C T T T T A C T G C T TC T C A T T A 

t cine t aekeesb aUtbbt tdd c a bbbi 

TCAACTACTTTTACTGCTTCTCATTA 
TCAACTACTTTT'A CTGCTTCTCATTA 



10630 
i 



TTATTTTCT Majority 



C T T C C 
C T T C C 

_t!!c' 

C T T C C 
CTTCC 



10640 
l 



CAAATACAT 
C A A A T AC A T 

J a tIKI aB 

c a a a t a c at 
cBaatacat 



10650 

JL_ 



I A I I J 



ttattttct 
t d a t tt t c t 
tH a tWR t c t 

ttattttct 
ttattttct 



2603_al2.seq 

18rs2l_ai2.seq 

515„al2.seq 

cjblll_ai2Jseq 

h36b_ai2.seq 



10597 
10597 
10489 
.10494 
10534 



TTGATACATTG TTGTC. TCTATCATGGAAACGTATTA ATTAT 

10660 1P670 10680 . 10690 

TT- GATACA T.T G T T.G TCTCTATCATGGAAACGTATTAATTAT 
T T i G _ A l m AC A T T G ■£ T C TC T C T A T.C A TC G A A, A C..GTA TT A ATT A T 

t TBm An a t Tin iunn t a bbi g ad adh t a t t ab t hhbi 

TTC'ATAC AT T JC T T G T C T C T A T C A t G G A A ACGTATTAA TTAT 
TT.fi ATAC A T tBt TCTCTCTATC A T G G A A AC GTATTA ATT AT 



AATGTAGTT Majority 



10700 
i— 



AATGTAGTT 
A A T.G, T A G. T T 

A- a'BDD a EBB 

A A f G t A G t T. 



2603_ai2.seq 
18^21^12.30^ 
515_ai2-seq 
cjblll_al2.seq 



AATGTAGTT h36b_al2.seq 



G T C C C T T A C 



T - - 



- C 



10647 
10647 
10538 
10544 
10584 



1071O 
t 



A T C C C T T T T C A C T G A A C T A C A A A A T A T A T T T A T C T A Majority 

• • • 



ctcc.cttact 
gtcccttact 
QtBDc t tQc tI 

gtcccttact 
gtcccttact 



^720 10730 1Q740 10750 

CAtCCCTTTTCACTGAACTACAAAATATATTTATCTA 2603_ai2.seq 
CATCCCTTTTCACTGAACTACAAAATATATTTATCTA 18rs21_al2^eq 



T "I A TB A A A 



j AEKK! CH THH AH Tig— DO ABaB AH TBI T AH T T ACS! T A 5i5_ai2.seq 
CATCCCTTTTCACTGAACTACAAAATATATTTATCTA cjblll alZ.sea 
C AB CCCTTTTC aQ TGAACTACAAAATATATTTATCTA h36b ai2 seq 



TGATACATTA.CAATATTG T GTCAA. AAATGTTATTATAGATTCCTTTAAAA Majority 

• i 



10760 
I 



10770 



10780 

JL 



10790 

JL 



10800 



S If ^ACATTACAATATTCTGTCAAAAATCTTATTATAGATTCCTTTAAA; 2 603^i2. S eq 

M88 rim nm I ximnB ^ T « T ._.L-i T ^ * J. T ££ A GATTCCTT.T A A A A 18r S 21_al2^eq 

mi i MJMM1 * 1 UkMBbBJI A TRHJ TISJBEHI A|sQ ABBI TB AB tQB "A R^]PffB__MWffBBH aQO A 515_al2 seq 

^ GA ^ A ^ A ir ACAATATTCTGTCAAAAATG ^ TA TTATAGATTCCTTTAA.AAcJbltl ai^seq 
TGATACATTACAATATTGTGTCAAAAATGTTATTATAGATTCCTTTAAAA h36b_aT 2 .seq 



10588 
10591 
10631 



FIGURE 19 W 
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t t a a T ^ e g u DEggB y a3 £SEi g xiiTttcttTc 

>0 ? l ° »0f20 10830 10840 1085O 

|°™ II**I TTAA ** A A*SACGATATACACTATATTTTTCTTCC - - J 2603 al2 sea 

10744 T TUTTTA A A A A A ( 0 AC C AT A T A C A C T At AT ITITC7 IG C T T^^l'.TLr. 

10638 T TDDD TO A A II AD A A C AB G A H AB AD AH TO T aQEDt ifl" r tM I Bltiw T ^ 2 seT" 

10641 TTAATTTAAAAAAAGACCATATACACTATATTTTTCTTC ™"" _ t olh.M 

1068! TTAATTTAAAAAAAGACGATATAGACTATATTTTTCT^CC I I I I T 

TACCTTA'CTTC CCA TAACTCTTTTTCTAATCCAAATTCGAC--TCACAAC Majority 
'Q? 60 10870 10880 10890 10900 

J^C C jTACT T i^ C ^^^ C ] ^I TTCTAA ' CCAAATTGQic -- TCAC *" 2603_al2.se q 
10785 TACCTTA C T T C C CAT AACTCTTT T TCTAA T CCAAATTGGAC - - TGACAAr 18^71 ^if ™ 

10688 B aIBtB a gtBcccHtaBct cB tH tEHIII t|B a a aB tDDa cQQ tQ aB a a g s£l i seo ^ 

ISS I A » TTt "TCCCATAACTCTTTTTCTAA T^T A A AT T™ A C ™ T G aT A A G oJbU ^sea 
10722 TACCTTACTTCCCATAACTCTTTTTCTAATCCAAATTCGAC - - TGAGAAG M6b"lt!^ 

CGTATCGATAATGTAATA-G C TATTTTCGAAAATTATCCCA. AATTCCAAA Majority 
10910 10920 10 930 10940 • ^0950 

!o8M rr?^rr^i^^f ATA "^ CTATTTTCC * AAATTATCC CAAATTCCAAA 2603 al2.se, 

S cDODDDD a tDI aQQQ a a tQDQc t aQBt tDDO a aBDtDBDDDBDBt tQQaOa EStST 

10730 CGTATCGATA ATGT A A TA - GCTATTTTCGAA A AT T AT C CCA A A T T C^T A A A r hi! it? 
1077O CGTATCG ATA ATGT AATA- GCTATTTTCGAA A a"a?C.CCA^ 

AATTATTACACCCACTCAAAGATGC TCTTCC - - CTTATCTGGCTCCTATC Majority 

' 10960 1097O 10980 1O990 * U0 Q0 

10882 ^TTATTACACCCACTCAAACATGCTCTTCC ~ - CTT ATCTGGCTCCTATC 26ftt a .9 «w 

!o?88 A BS T T f xW g in C AU T C A A iTMTl T mmk T « T w c c m£ T .5 a^?^ jcrcc t ire 

lores ambit ah TH c AB cjMMjT c a a IMP TiBl tm tH c cBBBtIH hIWIt rllllH t tin eic M 

10779 AATTATTA C A GCCACTCAAAGATGCTC TT C C™TT A™T C^CTCC T A^C c hm 

10819 AATTATTACAGCCACTCAAAG-ATGCTCTTCC - - CTTATCTGGCTCCTATC g£t!5«!^ 

ATGATGACTTGCTAAAAGT TGCTATCTTTTTTTTCCGAACATTTATTTTA Majority 

' ' 11010 110 20 11030 U040 U050 

10930 ATGATGAGTTGGTAAAAGTTGCTATCTTTTTT TBc CGAACATTTATTTTA 2fifn af i>^ 

loser atgatgagt.tggtaaaact TC ctatcttttt"?ccgaaca.?"a??t?a h^lalSr 

G G A T.T AAATCAATTAATCCCTGAAA C AATT- - TCATTTCCTTCATGG A A C Majority 

"° w . 11070 11080 11090 moo 

10979 G G A T TA'A ATCAATTAATCCCTGAAACAATT - - TCATTTCC TTfATrr A Ar 9^ 

££ if fl; A a A ; c A S:.:! f ?f i fff fl?"ff t ? ? ?f s ; ? c cf™ c c asss 

.0917 GGATTAAATCAATTAATCCCTGAAACAATT - - TCATTTCCTTCATGGAAC toetual2*seq^ 

TATCATAC ACATC ATAAATTAACTACTATTCTCCCACCGATAATTACAAA Majority 

11110 "120 11130 UI40 11JS0 

11027 TATCATAGA C A T G A T AAATTAACTACTATTCTCCGACCGATAATTJICAAA 2603 a!2 

X BDr cBt An^nfimni Tin im A c T £ T te£ C A G ^ C ^ A^^^ SSil^eo 

10933 |U T CHT AEDDDB GIUDI a TBI a|EI A C 1 M tBHB CQ G HP CB A BOlPH kWWMWn 515 a . 2 ™ 

10925 TATCATA G A C A T G ATA A ATT A ACTA CT A tTcT C C GK C G aTTa^ A^^f c bU 1 aTTsea 
10965 TATCATAG A C AT G ATA AATTA.ACTACT ATT CTCCGACOC A TA AT ? A C A A A h^Ob lla 

TTGGTTAAGTAAAATTGGAG AATA - - -CACGTTTA A. A G A. A C A AC ATT TTC Majority 
11160 ' 11180 11190 U200 

fHHBB A n T A raa T u G i . G A _ G . A -l -1 0 \ C JL T T JLi. A A ii A -U1± acattttc isrsai^ia^eq 

10987 T T Gimm AB T a|M! AB TH G Afl AD T AHBH C ABB T TED A AH aDDQ ABB T tHBH 515 al2 seq 

Wt T b4 AA -ATTGGA GA ATA- - - C A C C T TT M A C A ACA A?A TT?T? K« seq 

UOISB-TCGTT A AGTfflA A ATTG G AG A AT A - - - C A C||TTT A A A G A A C A AC ATTTT C h36b_aT2.s^ 

TTCTCCTTTGTACTCATCTA GAAACAATCATCAAAAATCATATTCCTCCG Majority 
11210 AtZZO U230 11240 U250 

TTCTCCTTTGTACTCATCTAGAAAGAATCATCAAAAATCATA.TTCCTCCG 2603 al2 seq 

HOJ7 EJBHJM C C Tfl|T GQ ABO] C A T BBK1 A lBtatUfSI A AH C aBR! aI8 aBI A T C aV72S3 rRSVn r rll cic c 

1.022 TTCTCCTTTGTA'cTcA T™ aTTTTg A AT C AUAHA A T C A m m ! c\tluj^se. 
U062TTCTCCTTTGTACTCATCTAGAAAGAATCATCAAAAATCATATTCCTCCG ^6b_aT2 
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Majority 



2603_al2.seq 
18rs21_al2.seq 
515_al2.seq 
cjblll_al2.seq 
h36b_al2.seq 




A T T A C A G 



A T T A C A G 
T T A T T AC A G 



A G A T 

* ' ' 

11320 
l 



TTTCTTCTAAAA AGATTCATTTCCAC 



T 

T 

11330 



C 

T" 

11340 



C C Majority 



MUT T| 

ATGCTTATTACAG 
GAATGCTTATTACAG 



AGATTTTCTTCTA 
A G A T T T T CTTC T^A_ 

1 T T T c BBB t 
AGATTTTCTTCT 



A 
A 



A 
A 



A 
A 



A 
A 



G 
G 



A a ah <; .1 t, r } a 



A A A A A C 



T T 
T T 



C 

c 



A 
A 



T 
T 



T C 
T C 



TTC.A] 
T T C A 



Jtc 



A 
A 



AGATTTTCTTCTAAAAAGATTCA 



T T T C C A 
T T T C C A 



C 
C 

C 

c 



11350 
i— 

C 

c 



c 
c 
c 
c 
c 



c 
c 



2603_ai2.seq 
18rs21_ai2.seq 
515_.ai2.seq 
cjblll_ai2.seq 
h36b_ai2.seq 



TTACTATTTATTAACTGATCATCTTTCCAATATTAC T AATCTTA ATCC A 

■ * t 1 '■ 



11272 
11273 
11181 
11170 
11210 



TT ACT 
T T A C T 



r r t (; c 



T T A C T 



11370 
i_ 




H380 



T 

T 

11390 



ATCATCTTTCCAATATTACTAA 

A A 



TGATCTTTCCAAT 



TTAACTGATG 
TTACTATTTATTAACTCATG 



A 

A 
A 



V A i, 



T T_ 

CT'TTCC 
C T T T C C 



T A 




j . 

TCTTAATCCAG 
T C T T A ATCC A G 
ran A a BET ' *" 



A A T A T T 
AATATTACT 



A T C 



AATCTTA A TC 



C 
C 



G_ Majority 

11400 
— 1_ 

2603_al2.seq 
I8rs21_ai2.seq 
515_.ai2.seq 
cjblll_al2.seq 
h36b_al2.seq 



A 
A 
A 



G 
G 



ACATTATTATTACCAATAA AAAGCTTTCTCCCTTTATCAAACATCAGATT Majority 



11322 
11323 
11227 
11220 
11260 



11410 

i 

ACATTATTATTACC 
ACAT. TATTATTACC 

Ar 

A 



A 
A 



T A 
t 



r t; ( 


T 


1 C 


T-A 


a c; 


aQ c a a t 


i r 



11420 
_i 

A A 
A A 
A A| 



A 

A 



G 
G 
G 



11430 
« 

T T T C TC C 
TTTCTC 

t tUth 



1 1440 
I 

A 
A 



11450 
i 

TG AC ATT 



CATTATTA T T A CC A AT A A A A A CC TTTCTC C CTTT ATC 
ACATTATTATTACCAATAAAAAGCTTTCTCCCTTTATC 




A 
A 



A A 
A A 



CATGAGATT 



2603_al2.seq 
18rs21_al2.seq 
Sl5_aI2.seq 
cjblll_al2.seq 



CATGAGATT h36b_a!2.seq 



TCTTCAGAGAGTTTAAT. TAC ATATATTGATCTGATTAATACTTCAGACCA, Majority 



11372 
11373 
11269 
11270 
11310 



C 
C 



T 
t 
T 
T C 
T C 



T T 
T T 



C 
C 



A 
A 



G 
G 



•\ r r a i. ( 



T 
T 



T C 
T C 



A 
A 



A 
A 
_ A 
G A 
G A 



11460 

A G 
A G 
A 

A G 



G 
G 



T T T A 
T T T A 
A 



J 
T T 



A T T A C 
A T T A C 



i (; a 



A G T T 



T 
T 



A 
A 



A 
A 



T T 
T T 




J2603_al2-seq 
18rs21_al2.seq 
S15_ai2-seqi 
cjblll_al2.seq 
h36b_ai2.seq 



A T C A A T C A A ATCCAAAAAATTATTTCATCAATACACGAACAAAAATATT Majority 



11422 
11423 
11319 
11320 
11360 




A 
A 



11520 

JL 

A 

A 



T 
T 



A. 
A 



T 
T 



T 
T 



l» { (' E A I, A I 



T C 
T* 



A A ATTATTTC At 
AAATTATT TC A T C 




i i t* a <; 



11472 
11473 
11369 
11370 
11410 



GCAAACTTTTTGCAAAAACTAATGAAATA 

11560 
1 

TTTTTGCAA 
TJ".T.T T C C A A 
T TO 



G 
G 



G 



C 
C 

c| 

C 
C 



A 
A 



A 

A- 



A 
A 



C 
C 



C ! I I A c; c 



A A A "C T T T T t G C 
A "A ACTTTTTGC 



A 
A 



A 
A 

Al 
A 
A 



A A 
A A 



T ! A A 



A ; A* 
A A 



T 

T 

11570 

c I 

C T. A 
C T A 
A 

6't'a 

C T A 



TAATCGTAG CTCCT 



A T 
A T 
A T 
A T 
A T 



G 
G 

g| 

G 
G 



A 
A 



A 
A 



i r r 



A C 

n — 

11580 

1 -i 

ATAACTAATCGT 
ATAACTAAT CG T 

T A A CQ AjQT C_ 
ATA AC T A A T C Gt 



\ A V 



A 

A A A T A A CTAAT CG t 



G 
r 

11590 

L 

A G 
A G 

G 
G 
G 



C 
C 

e 
c 
c 



T C C 
T C C 

t||c 

T C C 
T C C 



T 
T 

t| 

T 
T 



2603_ai2.seq 
18rs2i_al2.seq 
515_.al2.seq 
cjbl.ll_ai2.seq 
h36b_a!2.seq 

- - - - - Majority 

i 

- 11600 
. 1_ 

- - 2603_ai2.seq 

- - - , - 18rs21_ai2.seq 
DflDQH 515_al2.Seq 

_ l. _*- ^j bl i I„a i2 . seq 

- ti36b_iai2 . seq 



A T A A C TC TTA AA AATTA AC AT T A A A A A G C T > A G A G C A T T G T G T A A T G C-T C Majority 



11S17 
11518 
11419 
11415 
11455 



A T 
A T 
A T 
A T 
A T 



A C 
AC 

Be 

A C 
A C 



T 
T 



11610 
t_ 

C T T A A A A A T 
CTTAAAAAT 

cHt A A A A A t! 



T A 
T A 



A 
A 



T 
T 



C T T A A A A A T T A A 
CTT A A A .A A T T A A 



11620 
J_ 

A 
A 
A 
A 
A 



C 
C 



T T 
T T 



r 'I c cfj c t; 



c 
c 



T T 
T T 



A 
A 
A 

A 
A 



A 
A 



A 
A 



A 
A 
A 
A 
A 



A 
A 
A 
A 
A 



A 
A 



A 
A 



GC 
G C 
G C 
G C 
G C 



11630 
j_ 

T 
T 
T 
T 
T 



A 
A 

a| 

A 
A 



G 
G 



A G 
A G 



G 
G 



A G 
A G 



A 
A 



T 
T 



11640 
t 

T G 
T G 



T 
T 



G 
G 



A C A B <; V- V, C 



A 
A 



T 
T 



T 
T 



T G 
T G 
T G 



T A 
T A 
Tl 
T A 
T A 



A 
A 

a! 

A 
A 



T G 
T G 

T G 
T G 



C 
C 

C 

c 



11650 
i_ 

c 
c 



T 
T 



T 
T 



C 
C 



2603_al2,seq 
18rs21^_ai2.seq 
515J.al2.seq 
cjbll.l_aI2.seq 
h36b_ai2.seq 



•TAGCTTTTTA ATGTTAATTTTTTTGAATA AT A T A A T C 
- i i — — i — 



11566 
11567 
11466 
11464 
11504 



11660 11670 
1 i 

TAGCTTTTTAATGTTAATTTTTTT 

TAGCTTTTT A AT GTT A AT T T T T TT 

a cQtIB t tD aIBIgBIt Af 



frrH r ( a a' a 



tagctttttaatgtta 

TAGCTTTTTA ATGTTA 



T| 

ATTTTTTT 
ATTTTTTT 



G 
G 
G 
G 
G 



11680 
1 

T A A T A 
TAATA 

UUUt A 

AATAATAT 
AATAATAT 



A A 
A A 
A Al 



T 
T 



A 
A 
A 
A 
A 




TTTTCAACT Majority 



.TTTCAA 
A 
A 



T T T C A 



2603_ai2.seq 
18rs21_al2.seq 
515_al2.seq 

TCCAACTTTTCAACT cjbll l_al2.seq 
TCCA ACTTTTCAACT b36b_al2.seq 



11700 
t_ 

C T 
C T 
C 
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G T T T T 



m IZcTa* x:;ir..c iri ^ M jyt- T C T 

I 1 — I 



T T T 



11616 
11617 
11514 



11710 
_i_ 



11720 
-.,,1 .. 



11730 



ACCA ATATTCTGTTG Majority 



GTTTTTTCCCATGTCAAATGTTCTTTAAT T CTTTT 
C J T T T T T C_C C A T G TC A A ATGTTCTTTA A T T C T TT T 

cIKKKKI tHH cB t GE^SfSB atgBtc tHB a aDRI tRI t 



11514 GTTTTTTCCCATGTGAAATGTTCTTTAATTCTTTT 



11740 
i 



11750 



11554 



GTTTTTTCCCATGTGAAATGTTCTTTAATTCTTTT 
TAGTTTCTCTCTTAATCCCTTATCTTTTAC TA ATA 



AGCAATATTCTGTTG, 2603_jal2.seq 
A G C A ATATTCTGTTG I8rs21_al2 .seq 
A GOB AD A TBS T GO tQ 515_al2.seq 
AGCAATATTCTGTTG cjblll_ai2.seq 
AGCAATATTCTGTTG h36b_ai2.seq 



11666 
11667 
11564 
11564 
11604 



11760 
I , 



11770 



11780 

JL 



TAGTTTCTCTCTTAATGCCTTATCTTTTACTAATA 
JLA- 0 . T l T _^ C J^ C _ T _ C _ T TA ATG C c T TATCTT TT AC T A A T A 

«1 TUMTOWMllri t ammi ell r lllHIH t tHBt a a 

TAGTTTCTCTCTT A A TGCCTTATCT TT TACT A ATA 
TAGTTTCTCTCTTAATGCCTTATCTTTTACTAATA 



A A T C A A G 
1 

11790 
i 



A G A T T C A T 
r 



Majority 



1180O 
i_ 



f i t; 



AATCAAGAGATTCAT 
AATCAAGA G AT T C A T 

aED a|| gUgDH tII aQ 

AATCAAGAGATTCAT cjblll_al2."seq 
AATCAAGAGATTCAT h36b_ai2.seq 



26Q3__al2.seq 
18rs21_ai2.seq 
515_al2.seq 



GGACT GACTGAGTAT TTTCTTCCATGATGATTCCTAACTCAGGGCTA Majority 



11716 
11717 
11614 
11614 
11654 



11810 
— i 



11820 
■ 



G G A G T 
G G A G T 

BE a 

GGACT 
G G A G T 



11830 
_i_ 



11840 

JL 



11850 

JL 



I C A I A A 



gactgagtattttcttccatgatgattcctaactcagggcta 
- c - a £ t j£ a g t a t t t t cttc catg a tg at t cct aa c tca g ggc.ta 
.aBtBIaHt a t t t t cBt cDDBD aDDDBtDDD aDcDDBgD g|ItH 

gactgagtattttcttccatcatgattcctaactcagcgcta 
cactgagtattttcttccatcatgattcctaactcagggcta 



2603_ai2.seq 
18rs2l_al2.seq 
515„ai2.seq 
ojbll l_ai2.seq 
h36b_ai2.seq 



T 



11763 
11764 
11664 T| 
11661 T 
11701 T 



.-.CAATAACTTCAACTGTTCCACCCCGATCTGTTGCAATAATAGC MaJ or i ty 

11860 H870 H880 ^ ' ' 
1 L— i 



11890 

JL 



11900 
1_ 



a r r c i r t; 



C A A TAACTTCAACTGTTCCACCGCGATCTGTTGCAATAATAGC 2603 ai2.seq 
P A |A VJLLI IS. A A , C _ T _ G _ T TCC ACCG C G A T CTC TT GCAATAA T ACC 18rs21_al2.seq 

U IUliS I II A A IKK! tHHO A|BgC gH TQBH TQQ C A A T aH-TEKB 5I5_ai2.seq 
CAATAACTTCAACTGTTCCACCGCGATCTGTTGCAATAATAGC cjbl U_ai2.seq 
CAATAACTTCAACTGTTCCACCGCGATCTGTTGCAATAATAGC h36b_ai2\seq 



11807 A C 
11808 
11714 
11705 
11745 



ACTT C A A AGTAGACCAGCTTC T AAA 

I I : —y 

11910. 11920 
— 1 : L 



- ATACAGCTTGCTAATCCCTC Majority 



11930 



11940 



11950 



TT.GA AAGTAG'ACCAGC TT C T A A A ATAGAGGTTGGTAAT.CCCTC 2603_ai2.seq 

AC TT OA A ACTA GACC AGCTTCT AA A , . - - - A T AG AC G T TGGTAATCCCTC I8rs21 alsTseq 

m td gdd aikh gub aes t T n ion abumb i iihh tub t a a t cng t h sis^i.seq 

ACTTGAAAGTAGACGAGCTTCTAAA AT AG A GGTTGGT A ATCC CTC cjbll l_ai2 sea 

ACTTGAAAGTAGACCAGCTTCTAAA ATACAGCTTGCTAATCCCTC h36b_al2.seq 



TCGATACATTGAAGGGTA 



AACAAAGAT 
i — ■ ■ 



11852 
11853 
11763 
11750 
11790 



11960 
c 



ATCAGT CT G T 
1 • 



11970 
l 



11980 



T C C A T A C A T T G A A G G G T A A A C A A A G A T A T C A G T C T G T 
T GG A T A CAT T G A AGG C T A A A C A A A C AT A TC AfiT CT G T 

rawwn a cDD t g aDDD g tUUUU aH a EMM a HQ aHH cH g t 

T G G A T A C A T T G A A G G G T A A A C A A AG A TATCAGTCTGT 
TGGATACATTGAAGGGTAAACAAAGATATCAGTCTGT 



CCCATTAAACAC Majority 

11990 12000 
1 u 



CCCATTAAACAC 
G C C A TTAAAG A C 

g c c a QBOPOH a c 

CCCATTAAACAC 
CCCATTAAACAC 



2603_ai2.seq 

18rs2I__ai2.seq 

5l5_ai2.seq 

cjblll_ai2.seq 

h36b_a!2.seq 



AT AGTCTGTTCAAAGTTTAATTTCCCCAAAAACTTAATCTGTTTGGACTC Majority 



12010 

JL 



12020 



12030 



12040 



12050 

JL 



11901 A T A G T C T G T T C A A A G T T T A A T T T: C C C C A A A A A G T T A A T C T G T T T G G-A C f G 2603_ai2 seq 

11902 A.T ACT C T G'T TfijU A.C TT-T AAT T l-Me CC A AAA'. A<.G .T,T /f A A f C Kin G G ACT G 18^21^12 seq 

t c t gH tHDH aII t t t aIH tIWMIM aIIIIIUii xia r Timnr. r *ran * . '. 1 



11813 OB aht c t GHTiam KUr t t AU TDHKUB AtUHHEn AH c tOHHIg g a OB g sis a i2.seq 

11799 AT A vCT G T ^ t D'A J A A^C'T T T A^T^IfC X f ffl ATC tl?Tt C G A?T G cfblll ai2seq 

11839 ATA- GT.C T G TT C A A' AG TT t A AT'.t TC' C "C" C A^A^^AT CTC T TT G G "A CT G hUb^Z.s^ 

ATA TT TC TC TT TC A A AT Ct G C T A A TT C AC G T CC CTCf CCTCCAATCTG.TA Majority 

12080 " 12090 12100 



12060 
t 



11951 



12070 
JL 



ATATTTCTCTTTCAAATGTGCTAATTCAGGTCCGTCTCCTGCAATCTGTA 2603_ai2.seq 
11952 A T A - T JL T -y-J_ C _ T - T . T C AA A TGTGCT A ATTCAC G T CCC T CTCC T CGAA T C T GT A 18rs21 al2 seq 
11863 B T. ABH TEKPg TH C BE aOBBBBB aBHIMI G I BDD THD C HI tEDDB T CDDT A SlS.ai^seq 

ATATTTCTC.TTTC.AAATGTGCTAATTCAGGTCCGTCTCCtGCAATCtCTA cjblll ai^seq 



11849 



•-- — - - — — — - ---- - - — « w v * *~- v« » a.v^v^ 1 \r \^ i\ t\ I Vr X i\ ClOtHjaii.Sf 

11889 ATATTTCTCTTTCAAATGTGCTAATTCAGGTCCGTCTCCT CCA ATCTGT A h36b_ai2.seq 



AATAAACATTTTCA 



12001 
12002 
11913 
11899 
11939 



12110 
1 



G AG T ACTGTGA C ATC G A A AATCCTT CTAAGAGCAA 

. j- 



AATAAACATTTTC A 
A A T A AA CATTTTCA 

ESIt a QS c aQt t t c a| 

AATAAACA TT T T C A 
AATAAACA T T T T C A 



"T 

12120 
I 



12130 



~r 

12140 
i 



Majority 



12150 
i_ 



GAGTACTGTGACAT C^G AAA-ATGCT TC TAAGAGCAA 
G A G T ACTGTGA C A T C G A A AATGC TTCTAAGAGCAA 

u aHt aOMmKBI cDtBB a a IBBBB tHH tW aBI a g c a a 

GAGTACTGTGACATCGAAAATGCTTCTAAGAGCAA 

gagtactgtgacatcgaaaatgcttctaacagcaa 



2603_ai2.seq 
18rs2I_al2.seq 
•5l5_ai2-seq 
cjblll__al2.seq 
h36b_al2.seq 
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j¥| £ a'W,-,t H c A c r , A T , „ . T ? rT c A A A ; T , : 

_ Hiif! ^ "i 80 Sx, 

1205Q TTCAATGC CTTTTTfTTTA ata a T ~r r» ~t a r*J> ; 1 l. 

C A T C A G C A C A T T T T T C A A C C T A A G C C C T A C C A G C A A > A T C A G A C C. r. T A r. A HaJorlty 
12 ?'° ifff2 12230 ,2240 "TiSo 

£2 WSfS § » 5f!!ffSJi!*» 9ffii 1 4 ■ nASA-AS a^b t x a in ss£r 

'-""-cag-xx^tcaag-^aagcc^ 

CTrTCACATACCCAATTATAAATAACTCCTTTA GCTTCTATATTAAAATr. .fajo^y 
: "j" 0 12270 »2280 12290 

C?J?ciSA?lllll^??^?*ilI**«I««?ZIJ««'TCX;TAXTAAAAxi 2 6 03_ al2 . S e q 

12048 CTTTCACATACCGAA TTAT A aTTaaCTCCT Tli^? J™ I * I™ I I™ A HD S15_al2. S eq 
^CrT t CAUrACCOAAXTAT M :j:j^^^J 4 -J^^»- A ; A A - S:r 

TTTTAACCAT'TCAACGCTT CTCT TG G A T ACCGCATAAAAATOTCCAr/iA'r BaJorlty 
12 ?'° '2320 12330 I23« "~ ^0 

X^TTAACCATTCAA^G^^^ 2603_aA2.se., 
12103 BB Tffl A. AlB A f T C A ABB cBBB tMbM aH ffi |» ^ T £ J^^S-fc ' ^l.a.^eq 

12098 XXTTAACCAXXCAACG C T T C T CT1TT at a T^F? r rVr?i , "™ T " T «»««M IH T S15_al2.seq 

»™"TTAACCATTCAACCcttctcxtSSaxacc^^ 

AA.TGCTTAAC-ACGCGC-TGTGAGA. AGATGTTCATAG A TAGC TCOA a a (I 'a a 'a KaJority 
12 ? 6 ° ^ • 12390. ^Too 

a- irpji^Tl Tnnnns a s a a a s A inM a ; a c c a a i a g c s •; c c i a a a n a a a 

SS f flffixWSRHfft T C f?A if A f™ff i |M AO A A A B 

A2188 A A T G C T.T AACACCCGCTGTGACAACATCTTCArACATA CC TCCAAAGAAA MObuaT^seq 6 ^ 

TCTAAAAAACGATTATTGACAGAAAAATGACTTG ACCCATGGTCTAAAAr Majority 
12 ?'° lfJ20 12430 ,2440 '. ^ 0 

12301 XCTAAAAAACg'aTTATTGaI? 2603_jaI2.seq 

« 2 238 T C T .AAAAAACO GA . TTAXT G GA ^GAAA A A AA T G^ T ^ 

AATACTAGGTAAAT G'G XG ATTCTTTGC AAA- - -AGATACCCCTTCTAftrn MaJorlty 

12470- 12480 ,2490 ~^o 0 - 

12350 A A T AC TA GC TA A A T G fi T C A T *r r* t t *r r* c a a " V " ^ . ' 1_ ' - 

1235, A A I A Ct A'«*I AM_t M T C I IT C S i A A " " " A £ A I A C T T C X A C C G 2603_al2. se q 

12252 ABflAfl X ABOB * MM r • * A Ml a G V .l«n« C T T C i A C C G 18rs2,_ a ,2.se q 

12248 A A TACT AfrGT AAA I cTt G A r Tf ? r a t t™*" *^ A T ™BH CTTCB aBDEI SIS a!2 .seq 

.2238 A A T- AC X A O^O T A ^ |- T J ,t G * A* A* A - - A G A TAG CCfrtlCT AC C C MAlil^^ fq ' 
X X G X X A A CX G A A A A C G T G T AT T A C A A A XCACAAA A T *C AATATTTTft A f n t 
. >2 g 10 lf£20 ^2530 ,2540^ "li^O 

GAAACATATXXC'AXCAGCCTCXTGXAXTCXCGATT -TTTGTTA ata 

. !fff° 12 ? 70 «2S80 12590 IZoq 

iF ¥^W^S^ ^^n^ ^f^?i - - ^TtTgTtTaTa 

12346 B ABB C AB A X X BBBP C Afl c BBBB tB TTrBrrBT tIIMMW I w T A *» A l 8 ^2I_al2. seq 
12385 GAAACATATTTC. ATCAGCGTGTTGXATXCTCGATT """,^2^ 
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Afignment R^t O^'O 2006/078318 ein meth3T^th Weighted residue wei^t table. ^PCT/US2005/027239 » 

I nursday, July 29, iwi t»:^i; m 

A T A G G B Bgag 5 gjg H Ej g GACAATAnTTTG C TCGCTAA Majority 

12610 '2620 12630 12640 12650 

!^ iliSSJItSS GCTCCTTC ACAATATTTTTGGTCGCTAA 2603^2. seq 

}^ ill A H pat a W Pl" °« T L. C _ C _ T - T G A , C A ■ T A T T T T TGGT CGG T A A I8r32l_al2. S eq 

}?3?f A ° A ° g A I A B tlMMI MMIH GB THHH T C aBBP T ABUT tBD tBHD T AO 515.aI2.seq 

* £J£ i^r^i^S GCTGCTTGACAATATTTTTGCTCGGTAA cJbULai2.seq 

12430 ATAGGATAGC GCTGCTTGACAATATTTTTGCTCGGTAA h36b_ai2.seq 

ACGGTAAATT TT T C TACCCTTGTCTTCATCTATAATCGGTAAATCA. Majority 

12660 12670 12680 12690 12700 

]^30 ACGGTAAATTTTTC TACCCTTGTCTTCATCTATAATCGGTAAATCA 2603 al2.seq 

£^^JL A AA V" — G ~ T A _ C - C C L T _ T _ G TCTTCATCTAIAAT C .CCT A A ATC A I8rs21 al^seq 

12446 HDHA A A T mmH lB AH C cOEEI TCTtBatCTaBaBt CBHH A aDKHa 515 ali\seq 

12428 ACGGTAAATTTTTC TACCCTTCTC TTC ATCT AT AATCCCT A A ATC A cjblll al^seq 

12468 ACGGTAAATTTTTC -TACCCTTGTCTTCATCTATAATCGGTAAATCA h36b_a7 2 .seq 

CCATGATTAGTTCTTACAA TAACAACACGGTACCCACCCTTAACCAAATC Majority 

.• 12710 '2720 12p0 12740 12750 

CCATGATTAGTTGTTAC AATAAC A AC ACGGTAGCC ACGCTTA ACC A A ATC 2603 ai2 seq 
V G A T .I_ A G _I T °J& T A C _ A _ A _I A A C A A C A CGG TAG C C ACG CT TAACCAAATC ISrsil al2^eq 

££2 "? . I T ? Bt a c ™h a aubb a n hb am eh aHcbd adhc 5i 5 _aii.seq 

12474 CCATGATTACTTGTTACAATAACAACACGGTAGCCXcGCTTAA CCA A A T C cjbill a 12 seq 
I2SI4 CCATGATTAGTTCTTACAATAACAACACGGTAGCCACGCTTAACCAAATC h36b„aT2.seq 

TGCTGTCATTTTATCTGTATAACGTTCAATACCTCCGAGGAAGGCT-AGAT Majority 
* • ^J 60 12770 , 1278Q !2790 12800 

T GCTGTCATTTTATCTGTATAACGTTCAATACCtCCGAGGAAGGGTACAT 2603 a 12. seq 

12627 T_CCTGTCA T J T TAT C T G T A TAA CGT .T C A ATACCTC C G A G GAACGG T AG AT I8rs2l al2^eq 

12546 ggnOBOHD tHttHT CTG tEEIA ABKlT C A AQ aHH TBH g|| gBDBDDH T|8fl A T 515 all sec 

12524 TGCTGTCATTTTATCTGTATAACGTTCAATACCTCCGAGGAAGGGTAGAT cjbUi alTseq 

12564 TGCTGTC.ATTTTATCTGTATAACGTTCAAT.ACCTCCGAGGAAGGGTAGAT h36b_ai~2.seq 

AATATCCTGAGAAAAC ACCAACTGTTTTT.ACCTTATTTTCCATATTTATC Majority 
" . 12810 12820 12830 12840 . 12850 

JZ676 AATATCCTGAGAAAACAGCAACTGTTTTTACCTTATTTTCCATATTTATC 2603 al2.seq - 
J2677. A AT KTC C T GAG AAA A CAGC A ACTG T T T T T A CCTTA TTTTC C AT ATTT ATC 18rs21_al 2 !seq 
12596 DAT Alii cfl gBRI ADD aUJUU aBKBI T TtBtDcCTTATTT t|I c|3HBBBEI AQ C 515 ai2 seq 

12574 A ATATCCTCAGAAAACACCAACTGTTTTTACCTTATTTTCCATATTTAT Cclblll al2 sea 
12614 AATATCCTGAGAAAACAGCAACTGTTTTTA.CCTTATTTTCCATATTTATC ti36b__aT2.seq 

C A C T T T C A T C A AT AAGCCA TCTTTTAA G C CTTTAATCATAGCAACTATTT Majority 
12860 12870 12880 12890 12900 

12726 CACTTTCATCAATAAGCCATCTTTTAACCCTTTAATCATAGCAACTATTT 2603 ai2 seq 

12727 C AXTTT CATCAAT A AGCCATCT TTTAAC CC T'T T A A TC AT A G C A ACT AT T T 18rs21 aiCeq 
2641 C ADTDT CBBCD A T flBBHBHB THi AD OmSm TB AHHI A GB aIKE ATtB 515_ai2.seq ^ 

12624 CACTTTCATCAATAAGCCATCTTTTAAGC CTTTAATCATAGCAACTATTT cjblll a!2 seq 
12664 CACTTTCATCAATAAGCCATCTTTTAAGCCTTTAATCATAGCAACTATTT h36b_ai2.seq 

TTTTGCTCT TTTC C TC TTC TG C T-A C C A AC AC TC G A A-C A A A T TC A TTT C G C Majority 

. . * 2 ? 10 12920. 12930 .12940 12950 

127 T 6 T T T T G.C T C T TTT GCTCTUTGC TAC C A A C A C * C G- A A C A A A T T C A T T T C G C 2603 a 1 2. seq 

^ml-lwn l I Z ^Z^-^i!: A A ^ A ^ T r c . G * A G A A at t c a t t t cg c i8rsii^iCeq - 

12683 D TUUmi THT TH T G0T C tBHB G CDDA C A Afl aB tB dVBBSH A ft BBBBP tHBBH 515_ai2 sea 
126T4 T T T TG C :T C TTT TGC.TC TTC f G.Ct'A C CA Ad ACT C G A A C A A A T f C-A't TTC ™ cjblll alTseq 
12714 T TT TG CT C T.T TTGCT6TTCTGCTAC C A AG ACTCG A AC A. A AT TC ATTT C G .C h36b_aT2.seq . 

A TA A AT ACT A A AT ATTT- G.TATC G C TTCTTCTTACCA T A TTTTTTT ATA'A Majority 
' 12960 12970 - 12980 1299Q 13000 

12826 ATAAATACTAAATATTT-GTATC G.C T TCTT'C.TT AC CATATTTTTTTATAA 2603„ai2 seq 

12827 A T A A AT A CT A A AT ATT T-GU T CGCT TCTT CTTAC.C ATATTTTTTTATAA 18rs2I a!2.seq 
™f A A I A| !™? G T A A A T A T T TflD tH tBDBKI tQ9 T C T MB CflB ABB TB TB T aBaB 515_ali.seq 
12724 AT.AA.ATACTAAATATTT-GTAfCCCTTCTTCTTACCATATTTTTTTATAA cjblll ai^seq 
12764 AT A AATACTA A ATATTT - GTATC.G'CTTCTTCTTAC CA.TATTTTTTTA TAA h36b_al2.seq 

TATAGATCGCATTGCCTATCATGTAATATTTTCGAAATCGTGAATGATTC Majority 
' 13010 13020 13030 * 13040 13050 

12875 TATAGATCGCATTGCGTATCATGTAATATTTTCGAAATGGTGAATGATTC 2603 al2.seq 

2876 tn A vSrA G A r^' G nU T na C il C . G l T 4 T _ C A 1 G T A A T W _ TT CCA AAUCTG AATG-ATTC ISrsil alCeq 
12780.H ABB! G A ESI GBQ TH Gfl cSBOf tESSB G Tfl A T ABBES! C G AH aI91 GEE ABB G A TBfl 515 ai2.seq 

12773 TATAGATCGCATTGCGTATCATGTAATATTTTCGAAATGGTGAATGA TTC cjblll alTseq 
12813 TATAGATCGCATTGCGTATCATGTAATATTTTCGAAATGGTGAATCATTC h36b al2.seq 



FIGURE 19AB 




&ga O JE /078318 ^ ~ — Weighted residue v^M^l 
A A T A cIM. ggg gg jj g^fgg 319- 



CT/US2005/027239 

Page 30 



AACTC GTGAAGACTGTCC A A T 

i ' - y- 



13080 [ i 

12827 AgTnBH T GBP aIMD A tlllll l , , tIWOTI t rWMMB A A -£- A - G T GTCC AAT 18rs21 a!2 sea 

!SS ! i 1 1 g i j • ; : ; ; • • j Wf!f ; IWBl J! Sffff i iWjffff:-? 5kk£ 

AACTCCTGAACACTCTCCAAT h36b_aI2_seq 



Majority 



12975 
12976 
12877 
12873 
12913 



TTCGTGTAACAGA CCAATAAAATTAA - 
_____ I3II0 13120 



- C 



CTG A 
-1 



I Inrwn^Z mV C x A _ g _ a _ c _ c _ a ataaaattaa 

TTrrTrrT*^ ^B* ATAAAATTAA 
TTCCTGTA Afl A C A C C A AT A A A AT T A A 



13130 
— « 



TAAGTCTTATATCCC A T 

■ - . 1 - 



Majorlty 



13140 
1 



13150 



CCTCATAACTCTT ATATCCCAT 
n_n_n ?.J* Li T A A G TC T TA_L A T C C C A T 

iMwcnEUT a a ghhtishh ah cubt 

CCTGATAACTCTTATATCCCAT 
CCTGATAAGTCTTATATCCCAT 



2603_ai2.seq 
18rs21_ai2.seq 
S15_ai2.seq 
cjblll_ai2.seq 
h36b_a!2.seq 



"CTCT GAC AGAC GATAATTCA T t t r a » ^ ^ „ 



J__L ^^^4 ^^L^ ^ C C A T A A T T C A T T T C A G A C T^"^ ^ "fT ^ A A A T ^A^A T A A A C A T ^Tr^ LsOa « „ 

£1? fff ? ff c?f J 99 A T| ™ T c " 5i ab "n ? coon a a a A«g ? I : A A „ A ^ I '«^-^ 

^rr^ 1 CTCTGAC AGACGATAATTCATTTCAC A r t r^H^? - , . A "Wln TAAAB AQQt 5l5_aI2.seq 



13073 

13074 

12973 

12971 

13011 



13123 
13124 
13023 
13021 
13061 



CTTETCCAAACCCA CATGTTTCTTCCA A A a n V 1^2?° 

C T i£^ T i£ A ^ A CCCACATCTTTCnCG^ 1 p ' C I C ° T TT T C A T T A A A G C A 2603 a!2 seq 

Majority 



2603__aI2.seq 
18rs21_.al2.seq 
515_al2.seq 
cjblll_al2.seq 
h36b_al2.seq 




GCCGAAGTAATACACTCTTCAA T T T P T 
CCC G A_Aj_ TAATA Cm ctUAATTTCT 

gIKIg aBHt a a t aHIK c f____g aIwwb p* i 

C C C G,A A G T. A A T A C A C T G T T C AUn'r T 
GCCGAAGTAATAC ACTCTTCAATTTCT 



A A A A J 



TTATAGTCAAATTCT 
T TA T AG TC A A A T T C T 
tHD T Al| TH A A AH TO T 

TTATAGTCAAATTCT 
TTATAGTCAAATTCT 



T G 
T G 
T0 

T G 
T G 



13167 
13168 
13073 
13065 
13105 




CATCACTA AATtT 
C ATCA CTA A ATTT 

H AH c a cBaQattt 

C A T jC A CTAAATTT 
C A THa C .T A A AT__T 



C - 



rSiSt: ; ; s ; ? ; ? i j ; ; ; ; ; « f ; ; s ; j . ; « .H ; f 



13216 C 
13217 
13123 
13114 C 
13154 C 



i i (, r a 



CC<CCITUCT»nT C AA | if , T ,, T „ T[ , T ,, 
* 3360 »3370 l3 380 



CjC A C C T T A C CT £ - * *-t ? * A S ^ A A T T T T C A T A A 
f jTm IT m T . *L A^C/T^A A.AT G A A.C CTA A T t T t r a t * * 

i c cn co tie aq Gin ab i hb a mm rn fncn ? A A i 

C i C C ? t i 2 •? ? t V W A?^f ?fS CAT A A 
C G AC CTTA6G T A A AQ.G A A G G T A ATTT T C AT A A 



2603__ai2.seq 

I8rs21_ai2.seq 

515_al2.seq 

<U»in_aI2.seq 

h36b_al2.seq 



T T A T C T A T 



C Majority 



13390 
i 



13400 

TTATCTAT.C 2603^ai2.seq 

^ ~ - T T A T C T A T C 18rs21^al2.seq 
T T Al C^^q • 515^al2.seq 



A (; c i; 



- -,-.TT AT C T A f C cjblll_al2.seq 

- T TTATCTATC Ii36b_ai2 i seq 



13258 
13259 
13173 
13156 
13196 




AAATG ACCTAGGACAACCGA ATCf TGATtt i a - 

AAAlC^ACCTACGACAArrr.^;.!!; A !.? AaCTCA ACAACCAATCAAA 

k Aim ccsi An G m A n en a a mi tn A n ™™ T T £ A m^U C ^U c A ^4 



13308 
13309 
13223 
13206 
13246 



aaatcacctaggacaacuaatcttgatctuagtcaagaaccaatcaaa 

nCTTGTGCTACTGCAAATTGACCGATACAGTTCA 

13480 

A 



2603_al2.seq 

18rs2I_al2.seq 

515_£U2.seq 

cjblll_al2.seq 

h36b_a!2.seq 



13500 




13470 

^CT T T T rT T r e r C J A , CTCCAAAT " ACC " T ^*«ic 

T T CB DDB C T IITO C A t iHMS ilBM at A C a cm A 
t r ?^ T UtO|C A A AflKI AflQKl AHA C A gRUa 

TTCTTG^GCTArTrrl^^^^'^** A ^^^« A T Ac'A G^^T^C A 
1TCTTGTGCTACTGCAAATTGACCGATACAGTTCA 



Majority 



2603_ai2.seq 

18rs21__al2.seq 

515_al2.seq 

cjblll_al2.seq 

h36b_al2.seq 
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S5^K.^iPg& /078318 «" residue weigh, tabte. 

BH Tj jC X jagg | jgj j A ''A jjgff J__ jSggTc r.Tr.rcCCXCXXCATTATAA 

22! "f 30 » 3 5"0 I35S0 

13358 C T T T A T T TTCTCTTA AAXAAXCA ACAGTTA GGTCrfrPTrTTr a t t a t- < I .,,.„ 

13359 CTTTATTTTCTGTTA a a t a i ti- . 7 J . r i-i . . - IGCCCCTCTTCATTATAA 2603_al2.seq 
13270 C TllllnTt] R H tB ^ J T T |i ^ T ^ C ^ T ^ T £ 4 IS^Laia.seq 
13256 CT^aTfTTCTC^*^*^ A ^^?^ GBimC ™ TBBT|,AB HB' 1 * 515_a I 2.se q 

13296 CTTTATTTTCTGTTAAATAATCAACArxT^r^T^^^^^^^^^^^^^^^^^ cj bl I l_al_7seq 

»«-»«•» 1AAAXAATCAACACTTACGTGCCCCXCXXCATXATAA h36b_al2.seq 



13408 



TCCCCTACTAATTCAGAAATTTCTT C CTTATT T TTCGACCCATTATCTA ? Majority 

^? 60 1 3 ? 70 13580 13 590 "Tsloo 

~ J « J. 



"409 I c G gVcTTITI^^ 2603_ai2.sea 

13318 mm cUtE cB a aDDO a D Aim ? Eipn^ ! ! Immmn „^»M», A £ 1 . 1 " ' " ' ' 



GUTI 

13306 T C G G Cj 
13346 T C G G cj 



ACTA ATT GAGA A. ATT T cQ TCCTTAT £ TT T^^ C c'^^Tl^ a 9 ^f^. 2 ?? 

ACTAATTCAtA AATTT C T T CCTTATTTTTCGA^CCATTATCT AG SS^^T 



CATGTACATATCGCTTA CTTGA GG A - ; TAAATTGCTCCA ATCTTCTC Majority 

13610 t 3 620 13630 



13640 



13650 

JL 



13459 J_A_lH T A G A xB TGGCTTACTTGA JJ1 TAAATTGCTCGAATGT-TCTG 2603_ai2.se, 

.3368 nanB . £ _ ^intm ' ramm _ Aim imam a x x c c teh a Aim ! ™ " r 2 *- a12 ^ 

13356 GATGTAGATATGGC TTArTrr a _ ? U 1 ™ A A U»W T iMHU 515_ai2.seq 

^gaxcxagaxaxgJ.^.x?^^^^ 



13503 



AXCXAACCGXXCAAXAXXGGGGXXAAACGXCACA. AXACCCGCXAAAXAXX Majority 

1 3660 t3 670 ,3689 13690 "7-700 

_~ _ . 1 — J — _L. 



A^Jn- -r * aT^w^^ a w T — ■« T H A E AODDBn c a aH a BEBH c BB a WBBBI sis ai2 ^ 



13401 
13441 



TCATGTTCTATGC.TCTTTTCTA A AATCTCTAAATAACTGAATGAC 

13710 13 720 13730 13740 



Majority 



■ t , 13750 

13553 T.CATGT. TCTA.TCCTCTTTTCTAAAATCT CTA A at a a r t r i _ ^ u 

si : tti ;__ __ ___ Vm m_| ssssrt. 



13598 - - 

13599 - - 
13518 
13496 - - 
13536 - - 



r (» t a 



TGGTGCTTTrr tt ^^ AAAAACGATACCGA CATAGATAGTTACTGCT 

It c c t mmm t A A A _«t A i£ A l ( 1?L G - A _ c _- A . Tlj ga t _ a J- t t a c t g c t 

tgctp?^^ ™ A A aa A iB A nApnnAiiaTiiGnTiaiTiiBnTcnB 

TGGTGCTTTGGTTATAAAAACGATACCGACATAGATACTTACTGCT 



ACTAAACTTT G A A T - - 



2603_al2_seq 

18rs21_ai2.seq 

5t5_al2.seq 

cjbil__ai2.seq 
h36b__ai2 .seq 



- - G A C 



13644 
13645 
13567 
13542 
13582 



13810 
i 



A. CTAAACT T.f . G A A T - 
, A C T A A A C T T T G. A A T^ 
lift] A Q ADD J DO A A 1_ 

ACT AAA C T Tt G A A T - 
A C T A A A C T T T G A A T :- 



13820 
* 



ATAATTTACCAA TGATACTGACATTTCAG Majority 



13830 
i 



13840 

i , 



r T A V V 



GACATAATTTACCA ATGATACTGACATTTGAG 
G A £ A T - A -^ TTT AC C A A T G A T A G T '& A C A T T"T G AG 
AB a TfSg tH t|HB a T G A t IIIIMHI t t T Q aB 

G A C A T A AT t T A 0 G A A T G A T A C' T "G A C A TT T G AG 
G A C A T A A T T T A C C A A T G A T A € T .(& A C AT T T G A G 



2603_ai2-seq 
1.8rs'21_jai2.sesq 
515_al2;seq ; 
cjblll__al2.seq 
h36b__al2^seq 



G T A G C A G C A A T T A A A T A Majority 



13690 
13691 
13617 
13588 
13628 




GTAGCAGCAATTAAATA 2603_al2.seq 
C l 1 A AflA G C AATTAAATA I8rs21 a!2.seq 
c O A EHSi G C .A A T T A A AflHl 515_al2 seq 
GTAGCAGCAATTAAATA, cjbl U_al2.seq 
GTAGCAGCAATTAAATA h36b_ai2.seq 

A G Majority 



13637 7HT?£cCA l^?T^i S ff C^X^Wc fflS i . 
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13711 DBaIIaO cMHnli ATA ABSTfl MB t T.iiLl. 0 . T A ^i£, A T £ '8-2._al2.se,, 

13687 AGATACCTTCATA TAclfr If lr A . -r V 7^?!? "* A A T »~»~»~ll T A tMBE A T__ 515 a!2 sen 

13727 ACATAGCTTG AT AQ A G^ ^^^^^^^ ^ ^ ^ ^ * T T C A C T A A T A A C T C T A C A G A T A ^G^aT^s^^ 



ATACCTCCCATAGCACCTAA.ATTCGTATTA ^ ACTATATTAACCACAAC teJorlty 



j — ~ — iwou 14040 14050 

~ ^X*^ T CCCAIACCACCTUUtT CCTAm 2603_al3.se, 
137S2 A_| ABO tBD CD T AEHH C 1_D aH ?___■( A tR 1 _ A ML- C -L A - T - A -L T . A V__ C __ C A JLL 18rs2l_ al2 .L, 
13737. ATAGCTCCCATAGCACC T AA A A T T cFT * T x * « A A iDi C A__Q 515_al2. S eq 

--A 6 cu C cA T AocA?c^;:;^s^--::Aij;j:^::^: c c j: c ^-_r 



13889 
13890 
13802 
13787 
13827 



ATTfCCCACAACTC CAATAACTGCACACATTCTC TA 

14060 14070 14080 



t t. r a 



r a r 



r i t; a 



A A A 



CI A A "1 




- - AGC TTTTGTA Majority 
14090 i 4I00 

- A G C TTTTGTA 2603__al2.seq 
~ A G C T TTTC T A 18rs2i__ai2.seq 
A EB T|MB tH 5IS_ai2.seq 
AC C TTTTGTA cjbli I_ai2.seq 
" AGCTTTTGTA h36b_.al2.seq 



14140 
1. 



14150 

JL 



13935 CCTCTTCAACCCAGTACAT ACTCTCTrfrli a i^,.r,i 1 — -XT 

13936 CGTCTTGArcCCACT G TA G _T J " ACCU 2603_lai2 .seq 
13852 C G tDtBHa A G PB A G BBH A_Wg cBHBHWHHnH *A « G T T _4 C _S A T A AG A AA T 18 ra2 l_al2.seq 
13833 C G T C r™ AA G^ G^H? *™ c "??^ ? i9 i¥ S 5 I 3 B C " A T A,BHn A|S,H ^S.ali.seq ^ 

CCTCTTGAAGCCAGT T C ^ ' L CA I A ^ 



--CCTCTTCAACCCAGTAGATACTCTCTCC^A^ 
GCAAATGATCATCAAAGACTCAA C 




13985 GCAAAK AT C AT 



13986 
13902 



G C A A AT G A T C A t 







V 




si 



13923 GCAAATGATCATCAAACACTCAAC 



ACCCTTTCCC AAA 



13997 
13998 
13940 
13933 
13973 




■ 



14210 



AAAGAAGAC 
1 



A c bb 't 

A C ' 6 C" T T T C 
ACCCTTTC 



14220 



MB! I I A I H (- I I 



|vAj 

C C A A A A 
C C A A A A 




ACCGAC AACTTCAT Majority 

14200 

2603_al2.seq 

ACCGACAACTTCAT cjblll_;ai2^eq 
A C C G A C ABC T T C A T b36b.jai2.seq 

CCA -AAXXX X XXXXXXXXXXXXX 

^ f 30 14240 1425 0 



A r, v i .a i I a ti r t, c; 
a t; r c a i i a i; c'ij t, 





Majority 



A A 

A 



C jQ A A A (r I I A I I I ( At t, A I 

A_A_1..U t j A t (. A A A r LXXJ A C f A V 



A A 



14000 
14O00 
13985 
13983 
14000 



xxxxxxxxxxxxxxxxxx 

I 

14260 

1- L • 



A C* [' (i I A 


- - r-; 


k A A <; A 


A 


(' 


r (' c (, r a 


A '! C ,\ 


* I T A A I 


* A 


T 



Decoration -Decoration «<: Shade (with solid black) residues" that differ fro» the Consensus. 



2603_ai2.seq 

18rs21_ai2.seq 

515_jai2.seq 

cJblH_ai2.seq 

h36b_ai2.seq 

Majority 



2603l.ai2-.seq 
18rs2lJai2_seq 
515__al2.seq 
cjblli_a72;seq 
;h36b_jal2iseq 
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GTTCC ^ ? 53 « SSS Q' Aia:!c^ Cfc.M TTCTAAAGTCACATCCATTCT Majority 

10 20 30 40 50 



1 
I 



2 
SI 



2 

101 



52 
1S1 



102 
201 



152 
251 



202 
301 



252 
351 



302 
401 
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GTTCCTTC ATA AG TTCTCTGAAGGCCTAATTCTAAAGTCA CATGC ATTCT l^ifll^ 

TTCAGAAAGTTCAGCGACATAGTAT A TAGTTTCATCAGGTAACCAATCCG Majority 

60 70 80 9*0 100 




TTCACAAAGTTCAGC GAGATAGTATATAGTTTCATCAGGTAAGCAATCCG ne^Tela i 2^eq 
GCCTTGTTCCG ATGTTGA TTCCGATAACTCCTGGCTCATTAATAGCCTGT Majority 



110 
.1 



120 



130 



140 

JL 



150 

JL 



GCCTTGTTCCGATGTTG ATTCCC AT A ACTC CTfiCCTC ATT A ATA6 C CTCT 2603 ai2 sea 
CCCTTGTTCCGATGTTGA tH CCGATAACTCCTGGCTCATTAATAGCCTGT nemsTslal^^eq 

TCGTAACGCT OTTTAATTA TCTCTAACTTAGCATGGGTATTGGTAAAATT Majority 



160 
-I 



170 
-I 



180 
JL 



190 
i 



,200 
J_ 



?5?JMSS""J TAATTATCTCTAACTTAGCATGGGTATTGGTAAAATT 2603_a i 2 . seq 
TCGTAACGCTCTTTAATTATCTCTAACTTAGCATCGGTATTGGTAAAATT nex^aiZ.L, 

TTCAA-AATAGACTAAGTATTTATTA A CCTCGGGCCACTTTCTATGCATGA Majority 



210 
J 



220 
J. 



230 
i 



240 
i 



250 

JL 



TTrliiiiTjrirTA AGTATTT ATTAACCTCHGGCC ACTTTCTATGC ATG A 2603_al2.seq 
TTGAAAATAGACTAAGTATTTATTAACCTCGGGCCACTTTCTATGCATGA ne^l6_ai2?sea 

AAT CAATTTCTTTATAGAATTGTTCACGAATAGGAGCTTCTGGAGCAACT Majority 



260 
J 



270 
J 



280 



290 



300 
J_ 



MI1| AA IIJ5III ATAGAA ^^ 2603 al2.seq 

AATCAATTTCTTTATAGAATTGTTCACGAATAGGAGCTTCTGGACCAACT ne^Te.a i 2 seq 

ATAGCATCCCCTGAACCAGAAAC T GTGCAA.AAAGTGCATCCTCCTCTAGC Majority 
• 3 t 1Q 320 330 340 350 

lA JfjniTTFF^ 2603 al2.seq 

ATAGCATCCCCTGAACCAGAAAC'TGTGC A A A A A G T G CATC CTCCTCTAG C . ne^Te^ailtL, 

AACTCtTCCGTCTCTCTTAUACACTCAA A ACCACCATCTATAGCTAATT Majority 



360 



370 
J 



380 

JL 



390 
J_ 



400 

JL 



AAC I G II GCG I^I GTCT TAGGACAGTCAAAACCAGCATCTATAGGTAATT 2603 ai2.seq 
AACTGTTCCGTCTCTGTTAGGACAGTCAAAACCAGCATCTATAGGTAATT nexn3T6_.ai2.seq 

TAAA TATTTT. TTCTCCAAAG.AGTTCTCGATAATAATCATTAATCGCACGA Majority 

4 } ° • 4 ?0 430 "~ 440 . 450 

_ . _____ — ______ , j _ 1 i 1 



I A AA I A ""I™ 2603_ai2.seq 
T A A A T A TT TT T T C T C C A A A G A G T T C T C G A T A A T A A T C A T T A A T C G C A C G A nein376_ai2*L, 

TAACCTTT TTTCATAGG ATAATTGTATCACAATTTTAAC-rAAAATAACCT Majority 



460 
J 



470 



480 

JL 



490 
JL 



500 
JL 



5? iM^H T i TTTCATAGGATAATTCTATCACAATTTTAAC TAA*ATAACCT 2603 al2.seq 
451 TAACCT-TTTTTCATAGOAT-AATTCT.ATCACA'ATTTT A A CTAAAATAACCT neas76jai2.seq 

C A C T A C T A C A A T A A A A € TT A A A A A G A T-T G C. A A C GTCAGT T A GTCCCA A T C Major i ty 

S j° / • ^ 540 550 

TfTl* Tl A C A A T A A A A C TBA A - A AAG AtTCG A A CG TCAGTTAGTCC C.A A.T-C 2603. ai2.seq 
CACTACTACAATAAAACTTAAAAAGATTG-GAACGTCAGTTAGTCCCAATC ne*316_ai2 j!eq 

TTTTATTTACTTCACTTTC'TTT AACCAA TC CTTGGCTAAA.AACATATACC Majority 



402 
501 



452 
551 



560 
t . 



570 
JL 



580 
JL 



590 

JL 



600 
J_ 



502 
,601 



III!!II!f CTTCACTTTC TTTAACCAATCCTTGGCTAAAAAGATATACG 2603 ai2.seq 
TTTT.ATTT ACTTC ACTTTC TTTAACCAATC CTTGGC-TAAA AAGATA.TAC G nem3T6_aI2Tseq 

CAGTTAGATTCAAAATA'CCATAAGCAAG TATAAAACCAGCTAAAACATCT Majority 
™ 620 630 64Q 6 s 0 

TYJTTtfYT^^ 2603_al2:seq 



CAGTTAGATTCAAAATACCATAAGCAAGTATAAAACCAGCTAAAACATCT nex^a i 2^eq 




CT/US2005/027239 
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552 
651 



602 
701 



g t c c GlATTl^ATlTfTTc^^ 

AAAAIOAACCCCTACGTAAATACGAGATAACCCAATTAAAAAAAT nenm6_ai;Tseq 

GACCAAACCCAAACTACCTTCCCACAACACTTT C CATATACTCTTACCCA Majority 
H!_ ™ . 730 7 ' 40 

::jj:j;sssi!aj;sjj;sjiJ««jj§}i;;j?sjj;?h;j;;}S!j; sacs. 



652 
751 



702 
801 



752 
851 



i . ?™ /yo 8oo 

T A T A G T ACTGCAATAAAAt^at^ ATACTC CA ^603 ai2.sea 

TATAGTACTCCAATAAAATAATAATACTCCCAAATATCATAAATGTTCCC nemSieJ i Z^L, 

G T Majority 



AiUUAOTGCCCACTCGCAAACGAATAGCCACCTGCAAATACTAAATGGGT ne«3l6_a i 2^eq 
TAAAGTTGGtCTTACTCTTT CAAA A A T A A C T T T T A A A G A A A G T A T A n A T A WaJorlty 

860 880 890 



802 
901 



852 
951 




902 
1001 



— — ■ « H ™ 1UUU 

T T C T T A A GG ? A A C A r £ A A * I * A £ * C T :C A T A A T C G C A A T A G C T A T C T G G C I 2603.ai2.seq 
TTCTTAAGGTAACAGAAAGTGACGCTCATAATCGCAATAGCTATCTGGCT ne^Te.aiiTseq 

TACAGTATTACCAATCACAGTGATTAACT T GAAAAATCTTGTAGAAAGAT Majority 
l0 . 10 *° t 20 1030 \ 1O40 1050 

T A cXG^A^^ACCX^C^CAr^r A I ^ A A ^ I ^ G A A ^ A A T C T T G T A G A A A G A ^ 2603_al2, Seq 
A.UAOiATTACCAATCACAGTGATTAACTTGAAAAATCTTGTAGAAAGAT nem316_ai2?seq 

TTGGCAACTGTCCTCTAACA.CTTTCTTGAATG T TTTGGTCAAATGCAATT Majority 

Iflfin mMr* • « 



952 



TTGGCAACfGTCCTCTAACACTTTCTTGAATl 



1090 . 1100 

10S1 TTGc£AA^GTc7Tc4 a *"r^i™J^ AA T ™ T T T G G T C A A A T cfi A A T V 2603_ al2 .seq 

ituCAACTGTCCTC-TAACACT.TTOTTGAATCTTTTGGTCAAATGCAATT neafllS.awTseq 

ACACT-GTCGGG'CCAATA TTT GATGA C CAATCCTA AACTGAAAAATAAGAT Majority 

,uo ii2o - ii?n ttz. 



■ 1130 . 1,40 US0 

■135 ACAG?G?CGGGf ^ AA I A n iGAT ? ACCAA ^ CTAAACTGAA ^ AATA *eA; 2603 al 2 .seq 

ACAGTCT. CGGGCC 11 TATTTGAT6ACCAA.T til A A AC TGA A A A At A A G A T ne^Te.all^eq 
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1202 
1301 



12S2 
1351 



T T G T T A A A A'fc C Tin t-G'tTc' T A'"C a d''C *"A fAUTCTTCTG ATATC AAAGTTA Majority 

1310 1320 1330 1340 1350 

TTGTTAAAAGCTAATTGCCTACACCAATAAATGTTCTGATATCAAAGTTA 2603_jal2 seq 
TTGTTAAAAGGTAATTGCCTACACCAATAAATGTTCTGATATCAAAGTTA nem3l6_3l2_seq 

GCAAATATAGCATACAAAGGAATCGCAAAGACAT ACT T GAGA GCTACCAT Majority 

« . i i 1 • 1 ... | -J 

1360 1370 1380 1390 1400 
-I 1 1 1 1_ 



1302 
1401 



GCAAATATAGCATACAAAGGAATCGCAAAGACATAGTTGAGAGCTACCAT 2603_aI2.seq 
GCAAATATAGCATACAAAGGAATCGCAAAGACATAGTTGAGAGCTACCAT nem3I6_al2_seq 

AGATACGGTCAAGCTAACTCTACCAAATACACTAGCTTTAATAAAATCTT Majority 

1410 1420 1430 1440 1450 

— * : 1 1 i i_ 

A G A T A C G G T C A A G C T A A C T G T A C C A A A;T A C A C T A G C T T T A A T A A -A A T C T T 2603_ai2.seq 
AC AT A CH GTCAAGCTAACTGTACCAAATAGACTAGCTTTAATAAAATCTT ne_i3l6_jai2.seq 

TTGCACTCTCTCTATTTTTCCAGAAAATAGCGAAACTTCC. TAAAAATAA A Majority 

» 1 1 i 1 ~T 1 " 1— J 

1460 1470 1480 1490 1500 

: » : 1 j 1 i_ 

1352 TTGCACTCTCTCTATTTTTCCAGAAAATAGCGAAACTTGCTAAAAAT A A A 2603„al2.seq 

1451 TT GCACTCTCTCTATTTTTCCAGAAAATAGCGAAACTTGCT. AAAAATAAA new3i6_al2.seq 

GCTACAGCAACCATATTCATCCCTAAACCGATAAAGGT TTCTGGACCACG Maloritv 

T i ; 1 1 ; j— 

-1510 1520 1530 1540 1550 

1402 G C TAGAGCAACCAT AT T C A TCGGTAAAC C__| A T A A Afl GTTTCTGGACCACG 2603_ai2.seq 

1501 • GCTACAGCAACCATATTCATCCGTAAACCCATAAACGTTTCTGGACCA.CG nem316_al2.seq 

ATTAGCAAGTATAACT TT TAAAAGTCATCTTA ATAAGAGTACACCATAAC Majority 

1560 1570 1580 1590 1600 
— '. 1 1 i i i_ 

1452 ATTAGCAAGT. ATAACTTTT A- AAA G. TGATCTTAATAAGAGTAC ACCATAAC 2603_ai2.seq 

1551 ATTAGCAAGT A TA ACTTTTA AAAGTGATCTTA ATAAG A GTAC ACCATAAC 116*1316^12. seq 

TT.GATTTC AAATCAAATAAAATAAAA G.C AACTAACA T C GGAAG GATTGAA Maloritv 

i ~ t i r — r~ 

1610 1620. 1630 1640 1650 
— i . 1 1 « ^ 

1502 TTGAT T T C A A A T C A A A T A A A A T A A A A G C A A C T A A C A T C <I G A. A G G A T T G A A 2603_ai2.seq 

1601 TTGA'TTTC AAATCA'AATAAAATAAAAGCAACTAACATC GGAA G G A T T G A A new316_ai2.seq 

AAATCAACCTTTAAAAATTCTGCTCCTGGTATTAATG G^A A A T G AAACCAT Maloritv 

« . — , — . ( . 1— r~ J J 

1660 1670 1680 1690 1700 
. • ■ « » . — i 1 t_ 

1552 AAATCAACCTTTAAAAATTCTGCTCCTGGTATTAATGGAAATGAAACCAT 2603„ai2.seq 
1651 A A A T C A A C C TTTAAAAATTCTGCTCCTGGTATTAATGCAAATGAAACCAT nem316_al2.seq 

C AT CAATACAAAAGATAAGCCAGAAAGAATGGC G ATTGTCACC ATTTTAC Maloritv 
: i 1 1 1 r~ J 

1710 1720 1730 1740 * 1750 
— ^ : 1 : 1 1 1 i_ 

1602 CATCA. ATACAAAAGATAAGGCAGAAAGAATGGCGA TT GTCACCATTTTAC 2603_ai2.seq 

1701 C ATCAATACAAAAG AT AAGGCAGAAAG AATGG'CGATTGTC ACCATTTT AC nem316„ai2.seq 

G T 0 T A T T'T C T C A T A A A A A A A T T C C T C C A. A.T T T A A A T A A A T T C A A A C A A G C Majority 
. r - x ~~. • 1 « : r- J • 

1760 1770 1780 1790 1800 
• 1 : '. : I x ■ ' I i_ 

4652 G T G T A T T T G t C A T A A A A A. A A T T C C T C C A A t T T A A A T A A A T T G A A A G A A G C 2603._ai2.seq 
1751 G T G T A T T T G t C A T A A A A A A A TT C.C T CCA AT TT A A AT A A A T-T.G A A AG A A *G C nem316_al2.seq 

• • * - 

T C C A A AG G T A.ACC G T A T G T A C G C G A A A AAA AC C TT f.GTCT.TCT C,C CAT € C Majority 
■ - ™ » • : r — r 1 • ~ tt J 

1810 1820 1830 : 1840 * 1850 

. . • -: ■ • ' ^ ■ • ••■ »-__, L- L. 

1702 TCCAAAGGTAAGCGTATGTACGCGAAAAA AflC CTTTGTCTTCTCCC. AT CC 2603_al2.seq 
1801 TCCAAAGG TAAGCGT A|B G T A C G C G A A A A A A A C C TT T G T C T T C T C C C A T C C nei_316_ai2.se<j 

AGACTTTA. CTGTCGGTT G T GGAATCTCACCAC AT CAGCTTTCCCTCG C G G Majority 
• = 1 1 ; 1" ■■! -■ i- J 

1860 1870 1880. 1890 1900 

1 : 1 ' i : i i 

1751 AGACTTTACT.GTCGGTTGTGGAATCTCACCACATCAGCTTTCGCTCG C G G 2603_ai2.seq 
1S51 AGACTTTA CT G-T CGGTTG.TGGAATCTCACCACATCAGCTTTCGCTCGCGG nem31G_ai2.seq 

A-.CT CATGCTTCACAACTGACAAATAAGTTCGAAGCGA.TTACCGCCGG T C G Majority 

i t - i ; 1 1~ J 

1910 1920 1930 1940 1950 

1 = V « — i t_ 

1801 ACTGATGCTTCACAACTGACAAATAAGTTGGAAGCGATTACCGC.CGGTCG 2603 ai2.seq 
1901 AC TGATGC'T. TCACAACTGACAAATAAGTTGGAAGCGATTACCGCCGGTCG neia3l6_al2.seq 
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G G A A T Hf 1 Wi-e- Aifc <: cWg»% fefe?G''A S ^ A JAGCATAACAAAAAAAACTTG Majority 

I960 1970 1980 1990 2000 

^ i 1 i u. 

18S1 CGAATTACACCCTCCCCTGAAGACACCTATAGCATAACAAAAAAAACTTG 2603_ai2.seq 
1951 GGAATTACACCCTGCCCTGAAGACACCTATAGCATAACAAAAAAAACTTG nea3l6_al2.seq 

CAAT. TGCAAGTTTTTTAATTACTAATTAGTAGTAGTGATTAAAAATCATA, Majority 
2010 2Q20 2030 2040 2050 

1901 CAATTGCAACTTTTTTAATTACTAATTAGTAGTACTGA.TTAAAAATCATA 2603_.al2.seq 

CAATTGCAAGTTTTTTAATTACTAATTAGTAGTAGTGATTAAAAATCATA neo3l6_ai2.seq 



2001 



TTAATACtAAATTACTATCCTGTATCGTTTCTTTCA GATTTGCTATTT T T Majority 

• 1 1 — — — 1 r- J J 

2060 2070 2080 2090 2100 
-l— _ __l 1_ : i |_ 



1951 TTAATACCAAATTACTA T|| CTCT ATCGTTTCTTTC AG ATTTCCTATTTTT 2603_ai2.seq 
2051 TT A AT ACCAAATTACTATGCTGTATCGTTTCTTTCAGATTTGCTATTTTT nem316__al2.seq 

AGTTTTTCTTAAAAAGATAAACAAAATTCCCAAAATAATACAACCAAGAA Majority 
1 . — -, 1 ■ r- r- J J 

2110 2120 2130 2140 2150 

« 1 1 , i u. 

2001 AGTTTTTCTTAAAAAGATAAACAAA ATTCC C AAAATAA. TACAACC A A G A A 2603_aI2.seq 
2101 AG TTTTTCTTAAAAAG.ATAAACAAA ATTCCCA AAATAATACAACC AAGAA nein3l6__ai2.seq 

TTCTCAGTCCTCCACCAATAATCA TT CCTGTTTTAGGAA GAAATGATTCT Majority 

, -i -» 1 : r~ J 

2160 2170 2180 2190 22O0 

« — : ■ 1 i i_ 

2051 TTGTCAGTCCTCC'ACCAATAATCATTCCTGTTTTAGGAAGAA AT G A T T G T 2603_ai2.seq 
2151 TTGTCAGTCCTCCACCAATAATCATTCCTGTTTTAGGAAGAAATGATTGT nem3l6„al2.seq 

GGAAAAAGCGG TT GTGATGGTTTAGGATTT. CTTGGTGG A CG AC TTTCTT T Majority 

: 1 : 1 T — 1 r~ J J 

2210 2220 2230 2240 2250 

. 1 : 1 1 i : i_ 

2101 GGAAAAAGCGGTTGTGATGGTTTAGGATTTGTTGGTGGAGGAGTTTCTTT 2603_al2.seq 
2201 GGAAAAAG C G GTTGTGATGGTTTAGGATTTGT T G GTGGAGGAGTT T C T T T nem316_ai2.seq 

T T X GTTTTCTACCTCTACTTCCTGTGTTTTATTAGCAACTAC A G C A A C T A Majority 
1 , ■ — ■ , , , j- J 

2260 2270 2280 2290 2300 
1 1 _j , i : i_ 

2151 TTC GTTTTCTACC.TCTACTTCCTGTGTTTTATTAGCAACTACAGCAACTA 2603_al2.seq 

2251 TTCGTTTTCTACCTCTACTTCCTGTGTTTTATTAGCAACTACAGCAACTA neo316_al2.seq 

CAGCATCCTTCATAGATATACGGTAACCAGTTACTGC T T TTGCTTCTCG A Majority 

' » 'I " ' ' 1 1 1 i : 1 r~ J J 

2310 2320 2330 2340 2350 

: 1 ■ 1 1 l_ 

2201 CAGCATCCTTCATAGATATACGGTAACCAGTTAGTGCTTTTGCTTCTCGA 2603_al2.seq 
2301 C AG CATCCTTCATAGATATACGGTAACCAGTTAGTGCTTTTGCTTCTCGA neni316_al2.seq 

AAAATATACTTACCAGGTAATAAACCTTCAACCTCAAT TTCTCCCTTATC Majority 
, , , , p- 

2360 2370 2380 2390 2400 
1 1 — I * : i • i 

2251 AAAATATACTTACCAGGTAATA.AACCTTCAACCTC A A TTTCTCCCTTATC 2603_ai2.seq 
2351 AAAATATACTTACCAGGTAATAAACCTTCAACCTCAATTTCTCCCTTATC nem3l6_ai2.seq 

ATCAGTTACTAATGAAGTAATCCCATCTTGATCGG. TCGTAAATCGTCCAT Majority 

2410 2420 2430 2440 2450 

* « 1 J 1 t_ • 

2301 ATCAGTTACTAAT- GAAGTAATCC. CATCTTGATCGGTC G.T A A A T.C GT C C A T 2603_.ai2.seq . 
.2401 A T C A GT TACT AAT.GA ACT A A. TC CCATCTTGATCGCTGGT AAATCGTCC AT nem316_^ii2.seq 

"'..•* " * * 

TT.TT A A-A'b C G A A C T GCCTGATT C T G GTT ATCCTAT A. A T A C A A A T A T T A CT Majority 
— ^r- ; r . — : — : ■ r — = , . . 

2460 2470 2480 2490 . 2500 

i 1—: ^—J J !— 2 i • , I 

2351 . TTTTAAAGCCAAGTGG C T GATTCTGGT TATCGTATAATACA AATATTACT 2603_al2.seq 
2451 TTTTAAAGCGAACT-GGCTGATTCTGGTTATCGTATAATACAAATATTACT nem3!6„al2.seq 

CCTCATAGCCTTTTCTTTA ,T C TTTCCTTCTTTTGTATATTTAATAAGTTT Majority 

•« ■ t i 1 ' " « 

2510 2520 2530 2540 2550 

1 1 ■ : 1 I L 

'2401 C CO GATAGCCTTTTCTTTATCTTTCCTTC TT T T'G T A T A T-T T A A T A A G T T T 2603_al2.seq 
2501 CCT-GATAGCCTTTTCTTTATCTTTCCTTCTTTTGTATATTTAATAAGTTT nerti316„ai2jseq 

- T A A T C G G C C T G T T T C A A C T T T T C G C T T A G G A T T TATCTGTAATTG A TTT G Majority 
1 1 1 1 — : : r~ J J 

2560 2570 . 4 2580 2590 2600 

i l: 1 t i_ 

2451 TAATCGGCCTGTTTCAACTTTTCGCTTAGGA TT TATCTGTAATTGATTTG 2603_al2.seq 
2551 TAATCGGCCTGTTTCAACTTTTCGCTTAGGATTTATCTGTAATTGATTTG nan3I6_al2.seq 
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ATAACTTATCATCTGCTATTTCAATATAAAAAGG T A C TATTGTTCA AACG Majority 
r— — 1 1 i ; 1 r~ J J 

2610 2620 2630 2640 26SO 

: 1 — , i i i 



2501 ATAACTTATCATCTGGT AflT TCAATATAAAAACGTACTATT G T T G A A A C G 2603_ai2.seq 
2601 ATAACTTATCATCTGGTATTTCAATATAAAAAGGTACT AT TGTTGAAACG ne_t316_.ai2.seq 

CTTTGATCAGCTTTATAAGCACGACCAAAGTACGAACCATTTGGGAGTGC Majority 
1 1 1 1 1- 

2660 2670 2680 2690 2700 

1 i i i t_ 

2551 CTTTGATCAGCTTTATAAGCACGACCAAAGTACGAACCATTTGGGACTGC 2603_al2.seq 

2651 CTTTGATCAGCTTTATAAGCACGACCAAAGTACGAACCATTTGCGAGTGC nesi316_aU2.seq 

TATCTTTGTCTGACCATTAGTATCAGTAGGACAAGTCAACATACTCTTAT Majority 
1 1 1 1 i 

2710 2720 2730 2740 2750 
i i i t_ 1— 

2601 TATCTTTGTCTGACCATTAGTATCAGTAGGAGAAGTCAAGATACTCTTAT 2603_.al2.seq 
2701 TATCTTTGTCTCACCATTAGTATCAGTAGGAGAAGTCAAGATACTCTTAT nem316_al2.seq 

ACTTCTG'CTTCAATTCGCTATCTC T C A TTTGGCTCAATAAATCAACTTTT Majority 
1 1 1 1 1- 

2760 2770 2780 2790 2800 

1 1 i i i— 

2651 ACTTCTGGTTCAATTCGCTATCTGTCATTTGGCTCAATAAATCAACTTTT 2603_al2.seq 

2751 ACTTCTGGTTCAATTCGC TATCTGTCATTT.GGCTC AATAAATCAACTTTT nem3I6_ai2.seq 

AAGTTGTCAGTCACAGTCCATAAACGATAACAAATCCCCTCCTCTGTAGT Majority 
1 1 1 1 r- 

2810 2820 2830 2840 2850 

I l_ : l I 1_ 

2701 AAGTTGTCAGTCACA G.T CCAtAAACGATAAGAAATCCCCTCCTCTGTAGT 2603_al2.seq 

2801 AAGTTGTCACTCACAGTCCATAAACGATAAGAAATCCCCTCCTCTGTAGT ne*i3!6_al2.seq 
ATTTGGCYGAAGTCCTAT C t GTGTGATTGTTAGTTGATTAGCGGTATCAG Majority 

~ — ' I 7 ■ — ,. - T i , ■ ~ '■■ y- 

2860 2870 2880 2890 2900 

• i i : l i 1— . 

2751 ATTTGGCTGAAG.TCCTATCTGTGTGATTGTTAGTTGATTAGGGGTAJCAG 2603_al2 .seq 
2851 ATTTGGCTGAAGTCCTATCTGTGTGATTGTTAGTTGATTAGGGGTATCAG nem316_.al2.seq 

CATTTACACTGCCTACCCAAAAAAACGCTAATTGTACCA AT CCTAAAAAC Majority 
1 1 1 1 : 1~ 

2910 2920 2930 2940 2950 

: ; : 1 ; 1 i i : 1— 

2801 CATTTACACTGGCTACCG AAAAAA AC GCTAATTGTACCAATCCTAAAAAG 2603_al2.seq 
2901 CATTTACACTGGCTACCGAAAAAAACGCTAATTGTACCAATC CT A A A A A G nem316_ai2.seq 

CAACATAGTAGAAGTCCTAA AC TTTTT'C T A ATCTTTTTC ATTTTTG ATTT Majority 
1 1 1 ; 1 — r* 

2960 2070 2980 2990 3O00 
i i : i * i_ 

2851 CAACATAGTAGAACTCCTAAACTTTTTCTAATCTTTTTCATTTTTGATTT 2603_.al2.seq 
2951 CAACATAGTAGAA.GTC'.CTAAACTTTTTCTAATCTTTTTCATTTTTGATTT nem3I6_ai2.seq 

CCCTTTCTTTTTCTCTCTTTA A A T TTTC G TTTTAAATATAATAGTAAAGC Majority 

, J • ,,. ■ - . f r 

3010 3020 3030 3040 3050 
t I 1 ; I L_ 



29Q1 C C C T T T C T|l TTTCTCTC T.TT'A A ATTTTCGTTTT A A AT ATA AT A GT A A A GC 2603_jai2 .seq 
3001 CCCTTTCT T T TTC TCTCT TT A A ATTT TC GTTTTA A AT ATA AT AGTA A AG C nem316_.ai2.seq 

GACTAATATAACAATAACTAGGATTGATAAGAGGAAATAAACTTTATAGT Majority 
1 — : ; 1 1 ; i ; r~ 

3060 3070 3080 3090 3 100 
I : l__ I : t I_ .- 

2951 GAC.TA ATATAAGA AT AAC TAGG.ATTGAT A. AG A G 6 A A ATA A AGTTT AT ACT 2603_.al2.seq 
3051 G A C T A A T A T A A G A A T A A C T A G G A T T G A T. A A C A G G A A A T A A A G T T T A T A G T. nem316_al2.seq 

GTGTTTGCAATTCTTTCA T T.A A A T A GTTCTTTTC T T t A ACAGG A GGtAC A Majority 

3110 - 3120 - 3130 3140 3 150 

■ . t • ' < - ■• • 

3001 GTGTTTGC A A TTCTTTCATTAAATAGTTCTTTTCTTTAACAGGAGGTACA 2603_al2,seq 

3101 GTGTTTGC A A TTCTTTCATTAAATAGTTCTT'TTCTTTAACAGGAGGTA.CA neaj316_al2.seq 

TACTTGATTCGATGCCCTCTAACTAGTAAACGATGTGAATTAATCGAATA Majority 

; — : '■ — ' — » 5 « « 1 

3160 3170 3180 3190 3200 

« i i : i , — l_ 

3051 T A C T T.G A T T C G A T G C *.C C T C T A A C T A G T A A A C G- A T G T G A A T T A A T C G A A T A 2603_ai2.seq 

3151 TACTTGATTCGATGCCCTCTAACTAGTAAACGATGTGAATTAATCGAATA nem316_a!2.seq 

A GGTGTACATGTTAGCA A A G T C GCATAATCCTTACCTTTAACAACCAATA Majority 
. 1 1 1 ■ 1 — — r* 

3210 3220 3230 3240 3250 

i : i 1 i *- 

3101 AGGTGTAC ATGTT AGCAA AGTCGCATAATC CTTACCTTTAACA-ACC AATA 2603_al2.seq 
3201 AGGTGTACATG TT AGCAAAGTCGCA T A ATCCTTACCTTTAAC AACCAATA ne*3l6_ai2.seq 
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3151 
3251 



3260 



3270 



3280 

_I_ 



TTTGATCA ACCTTATAG Majority 
3290 33oo 



3201 
3301 



seq 



3251 
3351 



3301 
3401 



3351 
3451 



GCTAAAACTTCTTTCATATTATGAATATAAAAAATTTT T CCTTTTTTfl An ,^ orlty 
f 3 ™ 33 . z0 3330 33^0 ~^so 

t, A 1 ATTATCAATATAAAAAATTTTTCCTTTTTTAAC nem316_aI2.s 

TTTATCTAAATCTGTA A ATAACTTAGCTTTACGTAAGn CGCGATC A^rxr, 
33 . 60 3 3 , 70 3380 3390 ~^oo 

TGATAACACTATGTCAACT TTTT CCACCAATTCGCA A G G A G O T T r V T T . , MaJorlty 
3 . 4 . 10 *20 3430 3440 ^450 

AGG'TGTCCTGCTCCTTTTTC AAGA A C A CT ACTCGTAGTrC-CCGCATflOfl'T 
34 . 6P 3 1™ 3480 3490 3^0 




3401 
3501 



3451 
3551 



TAATCTCAAGCATCTCGGCGTATTCAGCAATACC T TTTTTTTnTTTTTCA Majority 
_ 35 - 6 ° f£0 3580 ; 

T i i-rf Tr^^** OA TC TGGGCGT'ATTCACCA ATACCTTTTTTTTC'TTTT-TCA Pfirra „ 
TAATCTCAAG CA tQ TGGGCGTATTCAC. CAATACCTTTTTTTTCTTTTTCA 



3501 
3601 




3551 
3651 



3601 
3701 



36S1 
3751 



G.T A T A G G GATCTGATAGGCGGCTTG'GG'TCCAGT GTTrTATTA-TA a ~ „ * ~ L m 
G-XA T AGGGATC.TGATAGGCGGCTTGGG?C^^^^^^ A ?^^J^^ 7 -^ 

TCCTAACTCAAATCCT CTATTA A TCTC TTTAGTATTTAATTTTTr.r.r.TT T 
gggg f670 3680 . 369u ~^o 0 

T^^^^^^^^A^^^^^^^^^^^^^Y^^^™^^^ A-.Gjfll ATTTAATTTTTG GGTTT ^e^T6^.aI2?seq 

, GATTATCAAAGTTAGTTACTTGATTATTAGCTTTAA T ATTATAG.TArr , . 

. 2? 3720 . 37>0 ^ ■ ^ Q 

G A T T AT C A A A.G TTACTTACTTCATTATTACCTTTA a t a t t a *r* a ^ > ^ ^ - ' 

TTTCAAATAAAAGC A.-T.A TCACGTTA T C A A A A « A C C - A AC T A A r. A A C A- A T A G Majority 
_ • 3770 3^6 3^" 3X0 

TATC ACCCCTACATTCATCCATCCATTTAAAAhr.A C CGATTTr.TTA , r.n-r toJorlty 



3810 



3820 



3830 



3840 



3850 

-JL 



3751 
3851 




3890 

A. 



TTTTCTCAAATTTTCCTCCCATTATGATTCAATTf rTTT.fTo ao^^^I ^ 
TTTTCTCAAATTTTCCTCfrATTATrlTTrwilf, CTAACKTK 2603_al2.seq 

1CCTCCCATTATGATTCAATTCCTTTTCTAACACTTG «em316 a!2seq 
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3801 
3901 



C T A A A j§S3ff ,T TWto |£Bg| j ggdj jggr ATT A UCAAACTAACCAACCA Ifaio^ ~ 

. 39i0 3920 3930 394Q ' 3950 

CTAAACCATTTTTTTGACCTTGACCTTTTATTAACCAAAGTAACCAACCA neaaTe'aiSeq 



3851 
3951 




3901 
4001 



ATAATAACTAAAGATATATAGA aHa CATATCTATAAATCCTGTTTAAATG ne*376_ai2^eq 

ACCTTCTTTTATTAATTTTTCATCAATAG G ACCTTTATAAGGGATACGAT Majority 

4010 40 . 20 4030 4040 ^4050 

-t_ « * _JL 



4030 

-L 



3951 
4051 



4001 
4101 



A C C T T C T T TH A TTAATTTTTCATC AATA|gACCTTTATAACgBaTACG AT ne^lG.aiCeq 
GTCCCC TTACTAAAAGT C T GTGTGTATTCA T C ATAATCCGGGTCCAACTT Majority 
4< y° 40 7° " 4080 4090. 4100 " 

C T C C CC T T A C T A A A A G T C T G T GTGTATTGA||CATAATCGGGGTGC AAGTT ne«m6_ai2?seq 

AATAAGCTTCCATAATCATGTCCAGGAAC AACCAACAAATCTGAAAAGTT Majority 

. 4150 



4110 



4120 



4130 
J. 



4140 



AATAAGCTTGC ATAATCATGTCCAGGAACAACCAACAAATCTGAAAACTT 2603 a!2 sea 
A AT A ACGTTCC ATA ATCATCTC CAGGAACAACCAACAAAT.CTG A A A A G T i 



4051 
4151 




4101 
4201 



A T C G G G T r T A A r r A r x t -r If 3? T G A TC ^ A ^ T TCATATGC'TATCGTTTCTT 2603_al2.seo 

ATCGGGTGTAACGAGTTTTATCTGATGTACTTGAT ATGCTATCGTTTCTT nein316_al2.seq 

TTATGTTTTGAATAtAAAACTTATCTCCTT TTTTTAACTTTTTAAGGTTA Majority 
42 ,'° . ' 42 . 2Q . 4230 4240 4250 

T T AT G T T T T A AT ATAAiA^TTiTr^^ C-TTTTTTTA.AC TTTTTA A GG TTA 2603.ai2.seo 

TTATGTTTTGAATATAAAACTTATCTCCTTTTTTTAACTTTTTAACGTTA nenaiG.ai^eq 

GAAAAC.ACTTCTTTATCTCGAATTCCTG ACTGCGCTGTTATAACGGTATG Majority 



4151 

4251 



4260 



4270 



4280 



4290 



4300 

JL 



4201 
4301 



GAAAAGAGTTCTTTATCTG GAATTCCTG AQ T CCGCTC TT A T A A cHc T ATf 2601 <^ 
GAAAAGAGTTCTTTATCTGGA'ATTCCTG AG TGCGCTGTTATAA C G G T A T G IZflllTse* 

TGTGCTATTTCCTCCAATTGCAAGAGAGG T ACCTTCTAAATCCCCTGCTC Majority 

4310 4320. 4330 4340 . 4350 



T^T^CTAT^r r^rr a A ^ I G G A A C A G AG G * A C C T T C T A A A T G C C C T G C T C 2603^12^ 
TGTGCTATTTCCTCCAATTGGAAGAGAHGTACCTtCTAAATGCCCTGCTC nem316_al2.seq 



4330 

JL 



4340 



C TTTAGATAGAACTTCTTGAC TT GAACCT G CAAATAT A*G GG AGTTTTTGA Majority 

4370 4380 - 4390 4400 



4251 
4351 



4360 
JL 



C T T T A G A TAG A A C T T C T T .C ACTTG A ACC Tg C A A A T A T A O C G A G T T T T T C A 2603 a!2 sea 
C T T T.A G A T AG A AC T T.C It G A C T T G .A. A CCTfi C A A Kit A.TA G GG A G T T TT T G A ^T^^eq 



.4301 
4401 



4351 



C C T A T C TTA G G A AC T G A A A TTG t T C C G A TT T T T T C A C IF T A C, C T €'T A A G A T Majority 
_ / 44 J°: : ,, j . ; • 44 .30 ■ 444Q- . 4450 * : 

C??A????^ GAAG J GAAA i T ^TGCGATTTTT 2603^2.^ 
C C T A T C T T A G C A A C T G A A A T T G ifflc C G A X T T TT T C A C T T A C C T C T A A C A T ne^l6LaiHeq 

ACGGGCGTACTCTGCTACCCC CTTTTGAATTCGTTT f .TT CTCATAAGCAT Majority 
4460 4470 4480 4490 4500 

ti^htf^^^ 2603^12;^ 



4451 ACGGGCGTACTCTGCTACCCCCTnTGAATTCGTTTTTTCTCATAAGGArn^eq 



4401 
4501 



CTTC.AACATGGACATTATTTAAAGAAT CATTATAAGCTTGTGCTACAGTC Majority 
4S . 10 45 ?° 4530 4540 4550 

n 115 * AGATGG AC ATTATTT A A AG AATC A T T A T A A G C T T G T G C T A G A C_l C 2603 ai2.seq 
CTTCAAGATG G"A CATTATTTA A A CAATCATTATAAGCTTGTGCTAGA GTC ne J n37 6 _ai2^e q 
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ATACG 



ft jG_A T T'C jyj^gEhpj _*jt_T <ji"T G A''€"T"K A' ;1 G T T T T T 
i T i 



4551 



TACCAGCTCTCTCAAA Hajorlty 
•. 4560 4570 4580 4590 4600 

4451 * I * ? ?IJ « A ,T T C A T T T C T T T C T C A C T A A C T T T T T T A C C A C C T C T C T C A A A 2«B ai2.aeq 

ATACGHCGATTGATTTCTTTCTGACTAAGTTTTTTAGCAGCTCTCTCAAA ne_t3l6_ai2.seq 

ATC C TGTGTTT GATTA TTAGATTCTATC GTATAGTA AAAACGTGATACCA Majority 
46 . 1Q 4 6 t 2 0 4630 4640 4650 

ATCCTGTGTTTGATTATTAGATTCTATCGTATAGTAAAAACGTGATACCA neoaT^aiSeq 

CTGGATA CAATAAAA TAGATAGACCTATTAGAAAAAGAATGATAA. AAGGA Majority 

. 46 t 60 - 4570 4680 4690 4700 

J 1 = 1 J ■ . 



4501 
4601 



4551 
4651 



^?rJ^r!^!!!^^ ATACACCTATTAGAAAAAGUT ? ATAU 2603_.ai2.seq 
CTGGATACAATAAAATAGATAGACCTATTAGAAAAAGAATGATAAAAGGA nem316_.ai2.seq 

AGATTTGACTTCTTCTTTTT TTTTGTTT. TTTTGTTGATTTTTTTAGTCTT Majority 



4601 



4 7IQ 4720 4730 4740 4750 
J ■ 1 — 1 J « 



AGATTTCACTT CTTCTTTTT TT T T GT TTT TTT gKI TGATTTTTTTA OTPTT 2603 a!2 sea 
4701 AGATTTGACTTCTTCTTTT TBHBfl GTTT TTTT GT T G A tB TT.TTTAGTCTT Si^iSeq 

CACGTCATCTCCTAGATAA TGGCTCTTGCTTATGATCtAAGAG TACTTCT Majority 

47 f°_ 4770 478Q 4 790 , 4800 

4651 CACGTQATCTCCT AG ATAATGGCTCTTGCTTATGATCTAAGAGTACTTGT 2603 ai2 seq 
4748 C A C C T C A T C T C C T AB A T A A T G G C T C T T C C T T A T C A r C T A A C A G T A C T T C T neirtjTe^aineq 

A C T G A AATACCCTTA G AT C AT A A G C AC-A G CTTTAAC TGTGC TT ATA CATC Majority 



4701 
4798 



4810 4820 4830 4840 4850 



4751 
4848 



* n I £ t A f t A C C CTTA CATCATAAGCACAGCTTTAACTGB|GeTTATACATC2603 al2.seq 
ACTGAAATACCCTTAGATCA T A A G CACAG C T T-T A A C T G TGCTTATAC aQc nea316_ai2.seq 

ATCAAAGACTAGCCTTAA GCTTCCTTTGATTGGCGTTT.TTTCATGATAAC Maforltv . 
48 , 6Q - 4870. 4880 " - 4890 4900 

a I n'tt * r*rZ^ZZ TTAAGCrTCC T _i T G A T T C G C G TflT T T. T C ATGATA A.C 2603_.al2.seq 
A T C A A A G A C T A G C C T T A A G C T T C CTflT G A T T gEJc GTTTTTTCATG aQa A C. ne_316_ai2^eq 

TACTGCTCCAAGCATAATGC TTAAACCAATAATTCTGAAAAGAATTGTAC Maforitv 
* 49 1 ^ 4920 [ 493Q 4940 4950 

TJ> I!" GCTCCAAGCATAATGCTTAAA CCAATAATTGTG AAAAGAATTGTAC 2603 ai2.seq 
4898 TACT-GCTCCAACCATAATGCTTAAACCAATAATTGTGAAAAGA-ATTGTAC nemsle^a 12 . seq 

CAATACCACCTGTTTGTGG GATTCT.TACTTTTTTGTTTTGTACTTCTTTC Majority 

. 49 t 6Q 4970 4980 4990 5000 

4851 CAATACCACCTGTTTCTGGGA T_TG TTA CflT TTT. T|Jt T T T|_It A cfiB G TH_rI_f 2603 ai2.seq 
4948 CAATACCACCTGTTTCTGGGA tQ CTTAC'TT TT TTGTTT T #B A cTt C nens76_*i2.seq 

CCATCTTTTTTTACAGCTTT TTTCTTATCTCCCTTGTCACTTITAG C.CC C Majority 

50 . 10 - 5020 5030 5040 5050 



i™l G C A T C T T 1 1 iBBBU^UoJ^ A-G 1 aH^BB C gJgT.C Afl tBt T a|8 g|| C C 2603 ai2. seq 

4998 GCATCT T TT T T. TAG A 61!^^ .r|.|B A^j_frS C cEft G T C A^T^l A^ CXCC nemsTe aiSq 

TTTTCTGTA TG A T G T. T T G-A -T T/ T ACT.TC 'A. A -A-.C TTT AT A TT -A.CC T GC C A A TT Majority 

-5Q60 / , ^ : SQ7 P ■ „ SOJ 8 ? 5090 • 5100 

ft* . TlflT^HTGT Ai G aBBw A _f T|g A C Tflc AAAflT T|B A T'A T T AC C T GCC A A«T 5603 al2.seq 
.5048 TT tUTFtBt AT.G A TgT|_I_0 A H T TACTTC AAAG TTT A|l A tQ<A C C T gQc A A^f ne*3i6_al2^ 

* '■ ' ' 

TCGCATATCCTGCTCGTCC T T G TCTTTCTTCCACCf T.CT A AC TGCCTTTT Majority 

. s »° 5120 5130 : S14Q ' S150 

~ ~ ~ ~- GHG cBH GTG TTTCTT C_EJ A G GTTGTAAG T0G CBtIIt" 2G03_.al2.seq 



5098 T C G C A T-Afl C CT'GCT'GGTGGTT G0 G-TTTCTTCC aB CD Tfl T A A G T G C C T TtB nem3T6_ai_!seq 

TCCAGACCTGTAATTTCAAA TTCACCTTGGTCCTTTGA-GG.TGTAT.TTA AT Ma tori tv 

51 t 60 5170 5180 5190 5200 



5045 Q CB 
5148 T C C A 



_G A C CTC T A AQjT TCAAATTG AC C T T G__| T C G T T T G aKI G T G T aWW T A A T 2603_ai2,seq 
B A C COG T A A T T T C A A A T T G A C cMMfcTH CGTtBg A^HtSSB A TTTAA_j ne«316_al2.seq 
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5210 




5095 --9- C JE™ T AiH c 

5198 BHcEfF T A T C A G C C T T T T T A T C T G 



CATTCATATGCTTTGTTAGCTT Majority 

5250 
A G COT 



5220 5230 5240 

T T A T C T G T JT A T C C Afa; T C A T AB G C Tg T 

HaBccattcatat GHfr 




2603__aI2.seq 
GTTAGCTT nem3I6_ai2.seq 



TAATGAAGC CTTCA TTGTAATCTTTTTGTTTTTTTTTGATACTTTCTTTT Majority 

5260 5270 5280 

5139 jjAATGAAG CC " 



5248 T A A 



A A G G C tjE 

a aQg c||t 



5290 
— I 



5300 
i— 



C ATHGT A A TCtHItTGTTtIBTtTTgAT AfcMflTTCTTTT 2603 al2 seq 
CATTGTAABCTTTTTCTT TTtHI TBB AQ A GtBH ClKC! T ne*37G:ai 2 !seq 



GCTGTTTTTCCTTTTTGGCCT TTTTGTTTTTTTTTAGTCAAGTTGTTATA Majority 

5310 5320 5330 



5340 
t 



5350 

JL 



5^1 G CTG^S'r c I l&llWZZ T T « T T ^taTEBT TEH A G H C A AB TT G T T A T A 2603.ai2.seq 
5298 G-C.T G T T T TQ G G T tHt T G G C C T t||t T G TT T T tJBt TTAGTCAAGT||CTTATA nem3IG_ai2.!seq 

ACCTTTCATAGCTTCATC TAATCCTATTTTAGCAGTTTTTGCACCTGCCT Majority 



5360 
l 



5370 
j. 



5380 

JL 



5390 



5400 



5^48 A a f|ifr G r C T T T T 5fI AEG C T A T T T kHBHI T THg C A G C TfcKHM9 2603„ai2,seq 

5348 A G C TT tBaB|A G C T T C A T cQ A ATGCT A||t TTAGCAGTTT Tt|| C A G C TGGCT nexaSlG^aizTseq 



TT'TTTTGA GCTTTT GCTTCATCTCCTTTACGTGCCAGGTATTTTCCTTCT. MaioMrv 



5410 



S420 

JL 



5430 



5440 



5450 

JL 



5^98 J TIM I ?W A A ?r C nS?T T ^ T WB A |S CTCCTTTAfcl GHGCflAGGTATT Tf! C C t|B C T 2603_ai2 .seq 
5398 T THITTHa G C|Bt TCC tEH Afl C T G C TT T A C G T G G C A||g T A T T T T C C X T C T .K^16_aiHeq 



5448 



GA-CTTTTTCAC.A AC GAA TTTTGCTCCTCCCAACCCTTCAGTTTTAGCTTG Majority 

S470 ' 5480 5490 5500 

-1— i ^mn ' _J_ " 
T C A ""_*; t™*>W|VTB 2603_ai2.seq 
T C A G TH TTAGCTTG neffl3l6_ai2.seq 



5460 

JL 



5333 G A G TTTTTCAC A AN GAATT 



t tB t tB,a CAAGGAA 



5480 

flr g cH ccBgcca aB C Gff 
I tB gctcctgccaagcgt 



A T TT G T T T T C A C A A A T T T C T T T C C A T.GA CTTTC C A CTTTTCGTTCTCTT C Majority 



■ 5S . 10 5520 5530 5540 5550 

5495 ATTTGTTTTHaCAAATTTBItCCATBaGTBIcCACTTTTGGTTCTgBtG ne*a316_ai2~seq 

GGTTGATT GGTGTT GGGTTATTTGAGTTTTTGTTCTTGTTGATGGTTACT Maforitv 

5560 5570 5580 ' 5590 5600 

5430 GGTTHaBtGGTCTTG G G TT AtBBg aHtHtTTGBtBt TgBt|aTGGTTA C T 2603 al2 sec 
5545 GGTTC A T T G G fl G T T G G fl TH A T f T G A G T T T T T cTl G T Tff?T U TflG ifl A cfl S^iSeq 

TTACCATTTTTTAGT.TTTAT T TCTTTTGGAGTGTAGCCGGTAACACGTTC Majority 



5610 

JL 



5480 -TT A C C A T T T THT A gB;T T T A T T 



5595 



,T A 



ACCATTTT TH AHT 



5620 
jl 




t T 



5630 



r . - . r 



5640 



G T AH C e 



5650 

__t . 



W r: — w — ~ - M AACACGTTC 2603_ai2.seq 

TflT Tfl T C G AG TGTAGC C ^HpT AACACGTTC nem316_ai2.seq 



TGCGATGCTGTATGTTTTGG T ATTTTCTAGACCTGTCAATTTACCCGAG.T Majority 

566° 5670 5680 5690 5700 



5645 nSclI AB|G.GH(; TiflCTTT THfte^A T *B T CB'A'B A C C t G TG A aBBBBBHH A G™ n^aTe.alZ^eq 

AT TTT C C A G ; .G T G TTTTC GT TT T G T T'GCT T *T TtC T C A C GTC Q CfTtTACCT Majority 

5710 , 5720 - .5730 ? * " ' 



5720 

•5574 ARTfBCl A G^frHH T tWBHBFt T T gBtB C 
5687 A TBiT T CB AB gB G t HU T 0 G T t T T G T T G C f t 



5740 



<* AffijBB (' (' A 



5750 



_„ A G G T C GE T TBT A C C T 2603_ai2,seq 

BSSfl T G T G AB GEBBPCIB T Tfl A C cQ he«3l6_al2.seq 



TTCTATCTTTTTGTTTGTTT TTCTtGCCCCGTAGTTTTATCTTTAT.CTTT Maloritv 

" 1 a 11 ■ . 



5760 
1 



5770 



5780 



5790 



5800 
t 



^ W^^^l^m3m& GT ™ |YtaiaTE,C TC ^ SS ^ a ^ A ^£^ T ATC TTT'ATCTTT.26D3_al2:seq 
5737 B T G T UbUJ IH G T T TGTTTTTC T T^Of^ AEEDBt AflCTTTABc TflB nem3iG„ al 2!seq 



C.AATGTGTAAGCAACGT TTACATCATTAGTTGTACTTTCTTCTGC C C AG T Majority 

^* 1 ' — 1 — = 1~ * 

5830 5840 * 5850 



S820 

5674 sjyy^-- J GTA a|H CAACGTTTA'C AD C A 
5784 B^^^^^^^^^^i G C.A A cH T tQ A C A T C A 




C C AEEI 2603_al2.seq 
CTGCCC ACT nem3l6lal2.seq 
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PagettT 



5724 
5824 



TTTTAG 



5860 5870 
A C tBt T AH T T C 



CGTGT ACCTTGTTGTTGTTTC Majority 

5890 



A { . V i 



T TTG 



5880 
CGTGTAl 



T T T AH T GACTT tRa T T T C T T C G TT|c C T G c|g T Ac"c?^i^G tJUg^ 

CTTTCTTTCCTTGGTTTCTTACCATACTCC AATTTf, ATATGATTGCTTTf: Majority 



x A c; 



5900 
— i— 



TC 2603_ai2.seq 
EBI nem3l6_ai2.seq 



5774 
5874 



A A « 



T C T T T 



5910 



a f r 



5920 



TGGTTTGTTAC C A T a| 



5930 

JL 



rt;. a 



5940 

JL 



C T T T C T TllCGTTC gB-TtBt T A C C AT ACT C C A A T T T 



C A A T T T G AT A T C A T T G gQ T T C 2603_ai2.seq 



5950 
■ 



ATCATTGGTTTC nem3l6_al2.seq 



5824 
5924 



TGCATTTTCAATACTTGITTGCCCCTTC A CCGTAGCTCTGTAACTCATTT Majority 

5990 6000 



5960 



t g g attBt c a a t aQt t gH'tt gBcc 



T G G A 



T T C A AHA G TH G T 



6010 
a. 



T A ACT! 



2603_aI2.seq 



TTflACBCTAGCL 

GCCGTTCACCGTAG cTgHgt A A G T G a"tH7 ne^ielal^.seq 

TGATTTCAATAtCTTTTTTTTTTCCTCC T TtTTCTACTTTTCCCAGTCCT Majority 

6050 

ED CCABjr C 2603_ai2.seq 
T G CQa gBc C T neta316_al2.seq 



5874 TB AQt T C A ATATC^ 
5974 TGATTTCA AH A T C T T 




6020 6030 6040 
' — : 1 MB M| 

BaBtttB 



T T T T G CH G C 



A 11 t 



CTACT 



5924 
6024 



TTTGCTTT CAA.TTTTA GTCTCAAACCTTGCTCATCTGTTCTCA C T T T G T A Majority 

. 6060 ~* ' 1 

tJGH g ctBt g 

D T T G cIJt TH 



6070 
Jj 



6080 



A A TTTT AflTG T G A A A C c| 



i; (i r a 



a r t; 



6090 

C A i cBB TH C T|h A G T T 



6100 
l_ 



GAAACCTTGGTCATCTGTT 



ETA 2603.jai2.seq 
GAGTTTGTA nen316_ai2.seq 




5974 G.t| 
6074 G T T T| 



| A G G A A A A 



TO 

T T 



T G T A| 

A T| 



6140 

GCTTTAAG Gfl T| 



J3 2603_al2.seq 
T G T TflC t T T A AflG T T G T ne*316_ai2.seq 



6024 
6121 



.TACTTGAACCCATTGTCAAACCTTTT G TCATTCTATCTGTCCAAACCACT Maior^v 

6200 



6160 



6170 



6180 

JL 



6190 



G T T C a A r ^VM V ! ^ A A A C CDT T T G0C A TtBt A T C T G KCAAACC Alft 2603_ai2.seq 
G T T G A A C^^^G TCA AACCTTT||GTCA TBfc T ATCHGTCCAAAC CAGT nemSlO.alCeq 



TTTTTCTAT TTTGAG CC TTTGTGA A.TTTTTGTTTTAACTTTGTATTGTTT Majority 

6250 



6074 T T 
6165 T t| 



6210 



tBg aH 

Btgag 



6220 



6230 



6240 

JL 



CCTT TGT GAATTTTT G.T TTT.A A C T T 



T Al 



2603_al2.seq 



CCTT TKJG A ATTTT TGT iGD A A C tBtGT A TtDTTT ne^a^.seq 



TTTTTTACCGATTTTCGCTGTGGCAGTT GCTTTGTCACGTGCATAATTAC Majority 



6124 
6215 



6260 
* 

SSBt ttaccgatttI 

T T fH TH A C C 



6270 



tH t c g c t g R gBc a 



6280 



6290 
x. 



6300 

JL 



G COG TBG C A G TTG CTTTCTC ACC f G G ATA ATT AC 2603^12^ 



A r A (' 



CTTTCTC 



nem316_ai2_s'eq 



CATAA TTTCCGCC ACCTCT . CAA A AG TCTATTTTCGTTTGT CA GTGTTCTC Majority 



6310 



6174 

6251 



.6320 
x 



6330 

Mm 



C AT A A. T.T X.G GGG.CAGCT.fijT:GAA\A.A^ f.C T ;A T f 





6340 

JL 



6350 
JL 



JB T G T C ABIT GH-T G T C 2603 aiZ.seq 



T T?TT C g H ? T G T C A & TlPlBH T C n^l6_^seq 



A A G TTG TTTGTTT T T G,C A A A G T.T T T T A T C T A TTTC TGG T.T T T T- T^T'T.C.AG.T Matbrti^ 

6380 6390 . 6400 

j. 



6360 



6^70 
J 



t£ t I I £^ I^P. G C ^ A A G T T T T T A T cA aJT T tH t G G T T T T t|B T T C A G T 2603 ai2.seo 

6272 A A cBItOt T tBItBBBFg C A A A G T t||T T A T C TAflt TGTGGT T T t||? T T C A G T ne^ILal^eq 

.G T T C T t T C G A T A A A T G.T G G G C, A T C T T T A A C A A C A C C T t G f T T G.T T T A C C A Majority - 



6410 



6420 



6274 GTTCTTTGGAT A A aHH TGGGCATC 

"6320 ~ 



* 6430 



6440 



a c; (' 



A AC A A C A C C 



6450 
i— 



T T C*T TTG.G ATA A A~T~G t gH G C A.T €"1^77 A AC A AC AC C^T^gIHt^^ TTTAC CA ne^lCi^eq 

..A TCCCAC ACTGATT TTAACTGCAACTCCTTTTC A. A TCACCCAGGA TG G A A Majority 

6460 6470 6480 - 6490 



6370 atggcagagtgattttaactggaactgcttttgaatcagcca 



G G A|Jg 

H g a tH 



6500 
t_ 



™l* ^^?^*5I5^I , ?II A AC l GG A ACh'gCTTTTG A ADC AG CC AjGG Akfl GGAA 2603_.ai2.seq 

G A A nem316_ai2.seq 
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^Wctt^I tW gl&g Eg gPSFcTc ttc AACCATTTTGTA ^t-t^ t^: 

_ „ _H __, _ f €540 

6374 C CUT tBttGTTGT AH tH G ATT X iBB T flFlI~~"~~ ~ ' 



6420 CCGTTGTT 



6530 G540 6550 

C T A~G~t" nCATTT tT7 T t tT a«3 I ^ A A C B ATTT 3n ACT* 2603L*12.seq 
^ l AGTTTGATTTTTCTT1 C AJJ TTCAACGATT t||gT a CTR t ne_316_.ai2.seq 

TCCTTTTAATTTTTTGGTGTTGAAAGCA A GTCCACTGTCTTTTTTTGGTG Majority 

_ „ 65 ^ |m| 65 . 7Q €580 6590 6600 

6424 B C C T T 70 A A J^PJL^LS, GTGTTGAA aP' c A A C T C C A C tI|t C tM_M T~ H T C G T C 2fim . 
6470 TCCTTTTA AOpTTjSEEH G T G T T_H A A AG C A AHH C C A G TG T C T I^T^D^ T^HHHBI ne^7e_,a^2^seq 



6474 
6520 



TTGATCCAGGCACGCCCTCATCTTTATTT TCTTTTGT.TTCCGGAGTATCG Majority 

6610 € f^ 66 30 6640 6650 

iAJ^TTjjic THTjJ^T T T CH_GC A C T aQ C^f 



T|H1 A T C C A Glre A C G G C CT GATC 
T G A MSB AHgH AC0G C EH C AN CTT 



(i {' A 



T C T T T 




TTCC 



2603_ai2.seq 
A G T A T C G nem316_ai2.seq 




6524 T C T T 



6567 



CTT 



^ 6670 6680 6690 

«ff 1 T k ^LZAA^_C_TG G TGTTAATTT GTTACCTT CTTTTEcCTTAAlh 2603_ai2.seq 
" 1 ' ^^^^^^^^^^^^^^^^^^S^^^mmacJglT; TTT c gg T A A T nem316._ai2.seq 



6574 
6590 



GTATTGCATTTTACCACTTTTATTTTTTTTCA A AGCTAAAGCAAAGAACG Majority 

Bt 
gB 




67i0 
1 



67 . 20 6730 6740 6750 

A T TJ GC_A_HT T T A C C aBtTTT A Tfl T t B T T T C A A A G C T A A A C C A A A C A A r k 2 603_ai2.se q 



ATT 



ttIStaccagtt tH a t t t tt\ 



A AAfiC AAAGA AH G nem3I6_.al2.seq 




6624 C A C C 
6628 C A C C 



T tOg ATTTCTTT 
D TT0A T T T C T T T 




G C C A A AC T A ADjCJOuU A A CO T C A 2603_al2.seq 
T T G AG C C A A aQt A A GQ T T T A AGGTCA nem316_ai2.seq 



6671 




6678 TTAATT. T G T T T A C C 



A C C T T TG TAG T C T T T TH C G T T 



A A V A 



T AK_ T C 




a a t; 



A CC T 



G T T C C T&G 2603_ai2.seq 



TCBTTCTTA CC T T T T G THC C tTH he_.316_ai2.seq 



6721 
6728. 



GO AGTTACTTTTG TTAAGT TTTCCTTGTGTTTTG AC A A.TCTTGTCC A AGG Majority 

6880 6890 6900 

AATCTTGTGCA Afl G 2G03_ai2_seq 



6860 

JL 



6870 

JL 



G «^^^T A C TT T T G T T A A gJE IT _T_T G_jT T iMfl G T T T__B| A 1 

G ^™ T A HBET THT0[A AllitaBBG CTT G T GH__)TGA CAATCTTGTGCA aIg ne*3T6_al2.seq 



TCACTGTATTAGTTGTTGCTTCCTCCGCAA A CGCTGGTGCAACTGAGAGT Majority 

6950 



6910 



6920 



6930 

JL 



6940 

JL 



1 r IrMIr I * ^ I T T cBlCCC C A A ACC CTC CT CC A ACIC A C A g| _ 2603 al 2 sen 

6775 T C A CBC T A T_[ A G T T G T TBc_T CC T'CCGCAAACGCTGGTCCAACT'GAG aRU ne^laif^eq 

AGTGACGTTAAGGTCAGTAGCAGTGTC G AGAACATTGTAA-GA T.A TTTCTT Majority 
• 69 . 60 _ 6970 6980 • 6990 ' 7000 

I&25 A^TGA C C T I A I r- i„ I W r A ? T ? B C G A *• A ACATTgDIaaBaTA I T T. G XT .2603^12.^ 

682S A G T G A C G T-.T A A Gfix C A&T A GC A C T G T.C G A G A A C A'Q T G t A A 6 AB A T T T>T T te^316lal2^q 

G.A T T. T T T T T C AT T T C T A T C . T CCT A: T T A T T T T A^G T T A A T C A A CA TG GTT A '1iattirAi-v »; 



687 i 



7010 
-i 



7020 



7030 



7040 
J- 



6875 fiUtilTTtCATTrCTATCTCCTtCIlifliH„;„^«;^ 



7050 



CAT t? T T T t r ^ J t t n t ^ i ? ^ ^ £ 3 J H Z I ^ 3 J ■£ A AACA T OHfe T A 2603^12.^ 




^ > 1^ I A I-Se G G A TT JL TAA T aWA:C CGC A.GC ACCACTC cB T T.C A AG TCaTg" 2603^12,sea 
6925 A T A AT A THC GC A T TflT A A T A T T A C C G C AG CA.CCACTC C™T T (JBOOBBBEI ne^^aiOeq 

CAATTTTAT TTAATT AATTAAGAATACTAAAGCGCATGATTTTTAATCTT Majority 



7110 



6970 



7120 



7130 



7140 

JL 



7150 

JL 



? * T T A A T T'A ATTAAGAAT AC T A A AC C G C A T_[ ATTTTTAATCTT 2603 al2 sea 

6967 GAATTBtATTTAATTAATTAAGAATACTAA ABcBc A TUT TflT T A A T C T T n^Te^i^eq 
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TTTtT 



fe'tk.ci'A TiMm UCT'T'A W^4^i e-Vfl 'x TAT 

I - T 1 — 



7020 

7017 T T TWT C T 



CTTTTCCAAATATAAATT Majority 
7160 7170 7180 7190 7200 



7117 



7170 
7167 



7220 
7217 



T T tIB^Th? J ^ 5 ACTACATTTCTTAT'ABCTTTTCCAAATATAAATi 2G03.al2.seq 

T T TOT C TflC ATATATCACTAGATTTCTTATATCTTTTCCAAATATAAATT nem376_ai2^eq 

- CCACCT GCAATACACATCATAGCTC C ACCTATTAAAATGAAAGATAGAAT Majority 
„ 72 . 20 7230 7240 7250 

70S rCArrTrr^^T^^ A ^ ATCA TCCACC TATTAAAATGAAAGATAGAAT 2603 al2.seq 

7067 CCACCTGCAATAGACATCATA g|!t CCACCTATTAAAATGAAAGATAGAAT nen^alZ^eq 

TCCTTTCCCACCTGTCATC GGAATAATTCCTTTTGGTGGAATATGCGTGT Majority 
rcGO 7270 7280 7290 7300 

7120 TcflTrTr^^ 2603_al2.seo. 

TCCTTTCCCACCTCTCATCGGAATAATTCCTTTTCGTGGAATATGCGTGT nem3l6_ai2tseq 

TGGTAATTAAATGCTTGTC ACCTTCCTCATGATATTCAGAAATCTCTTTA Majority 
73U) 7320 7330 7340 7350 

Jrr^MJI.^J^I^ I' C ACCTTCCTCATGATATTCAGAAATCTG.TTTA 2603_al2.seq 
TGGTAATTAAATGO.TTGTCACCTTCCTCATGATATTCAGAAATCTGTTTA nemSIG.alHeq 

TTAACAGCTA TTATAT TTTTTATCGATC CTTTAACC ACTTC A A A AGTTA A Majority 
J_5^ 7370 7380 7390 7400 

Z Z * f ~ A S IP A TT A T A T T T T i T ATCG ATCCTTTAACCACTTCAAAAGTTAA 2603 ai2.seq 
TTAAC AGCTATTATATTTtBtatCG A T C C T T T A A cB A C T T C A A A ACT T A A ne^ai^L, 

_AATTGGTTTA.T-TA.CT AATT TTTTG ATA ATCCTTCGGCGAAA CTG CTTCT A Majority 
__ Jj*^ 7420 7430 " 7440 7450 

^II^^I^^^^^^ntCAtAATCCTBcGGCGAlACTGCTTCTla^ al2.seq 
AATTGGTTTATTAGTAATTTTTTGATAATCCTTCGGCGAAACTGCTTCTA nem37 6 _al2.seq 

TTAACTGATATTTGCCATC TTTCAAATCTTTGTAAGAAATTTTGC.CGTTT Majority 
• 7460 7470 7480 7490 - 7500 

??A A STIFFS ATCTTTCAAATCTTTCTAAGAAATTTTGCCCTtT 2603_al2.seq 

TTAACTGAT AT TTGCCATCTTTCAAATCTTTGTAAGAAATTTTGG.CGTTT nem316_ai2.seq 

TCTCCCGTCACTACTTTTGAA TTA. TTATTTTTTATTGGTAAATAAAGTTT - Majority 
7510 7520 7530 7540 7550 

3^^L°IS ACT - CTtTTCAATTATTATT ^ TtTATTGCTA AATAAAGTTi 2603 ai2.seq 
TCTCCCGTCACTACTTTTGAATTATTATTTTTTATTGGTAAATAAAGTTT ne^Te^a i^eq 

ATAA TCTTCATTAAATTCTTCAACTTCAAACCTAGCTCCTTTCACAACCA Majority 
7560 7570 7580 7590 7600 

ZlSZ tZ^ZnZZ^ZZ^ A ^ TTCTTGAkGTTCAAACGTAGCTCCTTTGAG ^ AG ^^ 2603_ai2.seq 
7417 ATAATCTTCATTAAATTC TTGAAGTTC A AA CGT A'G CTCCTTTGAGAAGC'A nem3l6_al2.seq 

ACTTATTATTATCTTTATC AA CT TTTG T AA ATTCAATTTCACCTAACTTC Majority 

76 , i0 , " • 7620 ■ • 7630 ' 7640 7650 

A £ 1Z A Z Vt T ^ A Z C T T T A T ^ A A C." T- T T TH T A A AtTCAATTTC'A C € T AA GTj'jC- 2603 a!2,saq 
7467 ACT T A T T, A T T A T C T T T A T C A A C T T T f 'G t A A A T t C A A T T T: „C ; A C C T A A C f TC .k^^ail^eq 



7270 
7267 



7320 
7317 



7370 
7367 



7520 
7517 



.7570 
7567 



T T C T C C T T„T TTAA.TCGTT AT TG T A G C A T ATT dlC T C AC AT 6 A C G AATT TT Matoritv 
. , . . ■ 7660 • 7670 ■ . . ' 7680 ' . 7690 . [ 7700 

Z ZZ Z S S H t * T A -A T C G T -T AT T G T A G G ATATTCTCTCACATCACGA AT TTT .26oV*i2-seq 
TTCTCGTTTTTAATCCTTATTGTAGGA T A TTCTCTC AC ATCA CCA ATTTT nei.i31«_ali!1eq 

A G G C A T T G G A A A A T C TC T A A G T C T A T T A G G'A T C C T C T G A T T T A G C A T T C.A' Majority 

T710 7720 7730 . 7740 7750 

-* ■ : 



A P G G A T T G € A A A A T C T C T A A G T G T A T T A G G A T C C T CTG aQ T T A G GAT T C A 2603312 iseo. 
AGGGAJTGGAAAATCTCTAAGTGTATTAGGATCCTCTGATTTAGGATTCA ne«316_al£seq 

ATGTTGTTCTACCATTAGT GTCATAGAATT TC TTACTTATAAAACTCTCA Majority 
' 7760 - 7770 7780 . 7790 , 7800 

7620 ATGTTGTTCTACCATTAGTGTCATAGAATTTGTTACTTATAAAACTGTCA 2603_*i2 see 
7617 ATGTTGTTCTACCATTAGTGTCATAGAATTTGTTACTTATAAAACTGTCA nem316_al2Tseq 
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7670 
7667 



tcu p ctP iisc agg gugm cTtTt t 6T u ? T c T c c T A at?t ^; — 

7810 J820 7830 7840 7850 

TrTArT^Tr^r^T^^^^^^^^^^^^^^^^^^^^"'* ^ T C T C C T A A0T T 2603 a!2.seq 
TCTAGTTTCACATCATATGTGACTGTTACTTTTTGTC cflT cfl C C T A AG T T neaCjTe^il^Lq 

CAAACCTCTAACGTAGAGTTTATTTT TG AT GTATTCTAATTTAACCCCTT Majority 

7860 7870 7880 7890 7900 

— * aw 1 * 1_ 



13 



771? C A A A c C I C Va a Wl A ? A S333 ATT ^ ATGTAT TCTAATTTAA C C C C ||t 2603^12^ 
7717 CAAACCTCTAACGTAGAGTTTATTTTTGATGTATTCTAATTTAACCCCTT ne«316_ai2!seq 

TAACTATTCCACC ATCAT TATTAGC'CCCACCACTTC CAATGCTATCTTTC Majority 
' m 7920 7930 7940 7950 

?M?^TJr^^^^^ T ^ TTAGCCeCA ' CACTTCCAA tBgDa T cBt T C 2603.aI2.seq 
TAAGTATTCCACCATCATTATTAGGCCCAC CAGTTGCAATGC TATCtTTC nem316_ai2!seq 

ATTATACTTCCATCATTTCC -CTGTAAAGTATAAT. CACTTGGTTGTAATGT Majority 

7980 7970 7980 7990 8000 



7770 
7767 



7820 
7817 



A I I A 9 A r I J r r 1*3 S Til ! ^ C T G T A A A GT A T A AT C A C TTG cfl TGT AATGT 2603_ai2.seq 
ATTATACTTCCATCATTTCCCTGTAAAGTATAATCACTTCGT.TCHAATGT ne^I^aizleq 



TTCTCCCTTGCCAACCTCTAA ATTGATTTTGTCACCCATAGGATCTTCTA Majority 

8020 803Q 8040 ■ 8050 

7870 T T ^ T ° *??3 3~S S * * 5 £1 5 A A I I ? A I I TlE( T C A C C C A T ^ G G A T C T T cQ A 2603„al2.seq 



7867 



TTGTCCGTTGCC A AG CTGTAAATTGATTTT G T.C A C C C A T A GG ATCTT A nemsle.a^.seq 
T A C T T C C A T T A A C A A T T.G A G T T T T C T T T T G T T A A A A T C T T T T C A A A T T C T Majority 
SO, 60 6070 8080 8090 8100 

7920 1 TAGTTCCATTAAC. AATTGAGTTTTCTTTT.GTTA.AAA T cQt TTCAAATTGT 2603 a!2 see 
7917 TAGTTCC A T T A A C A AT TGAGTTTTCTTTTGTT AAA aQ CT'T TTCAAATTGT nSSeo 

T G C T G A A T T T T A C A T A A A A T T T C A T T G T T A G A TGTATCG GCT GAAG TT AC Majority 

J 8 V 10 Bt20, 8130 • .. 8r40 " .8150 

7970 TCCT G A A-Tl T T A G A T A A A A T T T C A T T (i T T.A G A TG T AT C G G' G T G A A G T T A C 2603 al2 sea 
7967 TGCTGAATTT.TAGATAAAATTTCATTGTTAG"ATgBaTCG.gHtGA AGTTAC le^lSiSea 

TATCGGGGTGTACTACTCAGGTTT G GAA&AG A AT GACTTCATTAGTTCTC Majority 

8160 8170 8180 - 8190 8200 

' t . _ " # / * 



8020 E A tBgG G G TGT AG T A CTCAGGTTTGGAA GAG A aQ G AcBtC ATTAGTTCTG 2603 ai2 see 
8017 TATCGGGGtBtaBtACTCAGGTTTBgAAGAGA aTgACTTC ATTAGTTCTG ^I^ail^eq 

TTATTTCTCCATCTGAAAGTT TAAAAGCTTCCTCTTTCAATTTTTCAAAA Majority 

f210 . 8220 8230 8240 8250 

8070 TfeX T TT C T C C A T C T G A A A G T T T A A A A G C.T T C C TC TT TC A A T TTTT G A A A A 2603 ai2 seo 
8067 TTATTTCBCC ATCTGAAAGTtBaAAAGCTTCCTCTTTC AATTTTTG A AA^ l^W^lTl^ 

GTACCAT CTT GA. TTTT.TCTT A T»A CTCCTC A TT A T A A ACTTGTCTAAAAGC Majority 

826 ,° 8270 8280 8290 - 8300 

8120 GTACCATCTTGA TT TT TC TT At ACT C C T C A T T A T A A A CTT GT C T A A A A 



~ — « - - - v - , v, i * i - a & i i V * * * i a> l u |U I U A f TATA A ACT.-TGT C T A A A A IB C 2603 a!2 sea 

8117 G T A C CAT C T T .G A T T TT T C' T T A * A«T C C'T CA T * A T A A A C T T G T C T A A A ^^Sll^eq 

A gjjJE A %4 T € T AT AC C A A A . AT T A AACATGT C-'A.T .A ATT T TT t T C T T f T A" A.A C Majority 

gj* 0 8320 . gyO . 8340 8350 * 

? 170 A G A T A f A T CH A T .-A CCAAAATTA A AQ A T GTC AT A A -T TTTTCTGTT TT AAA C 2603 al2 sea 
8167 A G A T A T A T C T A T A C C A A A AT T : A A AG ATGTC ATA ATT TT.TCTGTT'TTA A A C ne^T6lai2^ea; 

TATTTAT ATA AACTTTCCT TCGTGTTCC ATC T T.C TT. TTACTGGTCCATTT Matoritv 
' . . ' 8360 8370 8380 8390 ' 840o" . 

5 220 TATTTATATAAAGTTTGGT TC.G TGTTC CATGTTCTT tQ A CTGGTCCA'T TT 2603 aia seo 
8217 T A T T TAT ATA A AG T T T GG T-T GG TGT TCCATGT TCT||TT ACTGGTCCATTT nemsTe.a^Tseq 

CCATAAATTGTACCTTTAGG GTAATTAACA TT TAA A TCT'A A AT A ATC AA G Majority 
: 84 /° 8 y0 . 8430 . 8440 8450 

«270 CCATAAATTGTAGCT.TTAGGGTAATTAAGATTTAAATCTAAATAATGAAG 2603 ai2.seq 
8267 CGATAAATTGTACCTTTAGGGTAATTAAGATTTAAATCTAAATAATGAAG nem3Te_al2 seq 
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T T T T TitC'-fU AliG-f fejf jj^^Q^gA'M f"C tVtPg T TTCA TAACT ATCTAA GCGAA Majority = ' 

1 " I : 1 I : T" J 



8320 
8317 



8460 8470 8480 8490 8500 
- 1 : " -* — - j i_ 



TTTTTGTAAGTTTCCAGAGATTATCTGTGTTTGATAACTATCTAAGGGAA 2603_ai2.seq 
TTTTTGTAAGTTTCCAGAGATTATCTGTGTTTGATAACTATCTAAGGGAA rietn316_al2.seq 

ACAAAAACTAAC-TCTCCCCATTTCCTTTTATATCCTCCCCCT.TATCAGTA Majority 
85/0 8520 8530 8540 8550 

8370 ACAAAAAGTA ACTCTCCC CA TTTCCTTTTATATCCTC GGGC TT.ATC AGTA 2603_ai2.seq 
8367 ACAAAAAGTAACTCTCCCCATTTCCTTTTATATCCTCCGGCTTATCAGTA nea316_.ai2.seq 

AGTAG AAAATTACTTTTATTTAGATATCCATTTTTTTTCATTTQTTCAAA Majority 
8560 8570 8580 8590 8600 

8420 AGTAGAAAATTACTTTTATTTAGATATCCATTTTTTTTCATTTGTTCAAA 2603_.ai2.seq 
8417 AGTAGAAAATTACTTTTATTTAGATATCCATTTTTTTTCATTTGTTCAAA ne_316._ai2.seq 

TTGGCT TTCATATGATGCACCCAGTTTAAAATTATTAATAC C A T A T G A T C Majority 
; ' 8610 8620 8630 8640 8650 

8470 TTGGCTTTCATATGATGCACCC ACT TTAAAATTATTAATAGCATATGATC 2603_al2.seq 
8467 T T G G C T T T C A T A T G. A T G C A C C C ABT T T A A A A T T A T T A A T A G C A T A T G A T C nem316_ai2.seq 

T-TGTTGGA ACACCATCAGTTATATGAACAATAATTTTTTGACTATTTCCA Majority 

■* *^ ^ J *^^ — ~ — "™ ~ — ~« ~ — v — v — n — n — — — ^ — ™ — h |bm_— _B______^ l^n ^ V — *__— I ____M _■— __m_«_____«_^^m 

8660 8670 8680 8690 8700 

-I I 1 JL 



8520 TflC TllG G A A C A C C A T C A G T T AD A T G A A C A A T A A T T T T X T G A C T A T T T C G A 2603_al2.seq 
8517 TTGTTGGAACACCATCAGTTATATGAACAATAA.TTTTTTGACTATTTCGA ne_316_.ai2.seq 

TTTAC TTGACTC A_A AATATC A.TCTGCCTCC AT GAAGGC. TTTCATAG T A A A Majority 
' 8710 872Q 8730 , 8740 . 8750 

8570 TTTACTTGACTC AAA A TATCAT.CTGCC.T CCA TGAAGGCTTTCATAG-T AAA 2603_ai2.seq 
8567 TOT A C T TG A C T C A A A A T A T C A T C T G C C T C C A t G A A G G C T.T T C A T A G T A A A netb316_.ai2.seq 

TGTTTCTCCTACTTTACTAA CA T A G T A C T C C T T T T G T T G. C T C T G C A G T T A Majority 

• i i : 1 : r* • 

8760 ; 8770 8780 . 8790 -8800 

8620 T G T T T C T C C T A C T T T A C T A A G A T A G T A C T& CTTTTGTTGCTCTGGAGTTA 2603_ai2.seq 
8617 TGTTTCHCCTACTTTACTAA GAT AGTACTGCTTTTGTTGCTCTGGAG tQa tiem316_al2.seq 

GTCCG TTTGTA G T T G A T C C C C A T T T A G C T T T A C G A G C T T C T G T C G G A A T C Majority 

• • . \ * 1 = — r* 

8810 8820 . 8830 8840 * 8850 

8670 BtCCBttHgTA G T.H G A T C C C C AHt T A G C T T T A C C A G C T T C T G T C GG A A T C 2603_ai2.scq 
8667 GTCCGTTTGTAGTTGATCCCCATTTAG C.tHt AGGAGCTTCTG tQg G A A T C xiem316_ai2.seq 

CTTTTTATAATCTCTTCAGCATTATTTGT.T A'A T T G TT TATGACTATAA T T Majority 

8860 8870 8880 - 8890 . • 8900 

8720 CTTTTTATAATC T C TTC. AGCATTATT.TGTTAATTGTTTATGACTATAATT 2603„al2.seq 
8717 CTTTTTATAATCTCTTCAGCATTATTTGTTAATTGTTTATGACTATAA.TT nem316_ai2.seq 

* » 

CTCTCTCTGAATTGTGAACTTAGTTTGAAGGCCATAATATTTATCATCTT Majority 
? 8910 8920 8930 8940 .8950 

8770 CT C T G T C T G A A T T G T G A ACTTAGTTTGAAG G.CG A.t A A T A T T T A T 0 A T C T T 2603jai2.seq 
8767 CTCTGTCT G A AT T G TG A ACTTAGT T TG A A GG C C A T A A TAT t.T A TC A T G.T T ne^l6_^i2.seq 

XTTT AAATCCTTTTA CG A C A T C T A C A C T CCTAC C Jl T C _A A X A A T AT C TG A A Majority 
- ■ . 8960 .. . 8970 8980 8990 9000 ** 

8820 C TT T A A A T C G T TT T A CG A C A T C TA C A G T CC.TACC A TC A. A A A A T*A T C T G A A " 2603_jai 2. seq 
8817 C T T T A A A TC CTTTTACGACATCTACACTCCTACCATC.AAAAAT A.T CTGAA nem316_ai2.seq 

■ . ■ • 

. . CG A T A C G T A A C T A A T G C A A C C C T A T T A T C A C T G T T T G C T C C T A A A A T A.TC Majority 
• 9010 *- '' 902Q 4 • 9030 9040" . 9050 

8870 CG ATAGCTAACTAATGCAAC C C TATTATCACTGTTTGCTCCTAAAATATC 2603_.ai2.seq 
8867 CCA T.AGGTAACTAATGCAACC.C TATTATCACTGTTTGCTCCTAAAATATC ne*316___I2;seq 

. TTTTACTGCGGTCCCAAGAGCTTCGGCAGCTTTCTTGGCTTTATTATCCC Majority 

i i . i : — i : : i r~ • ■* 

9060 9070 9080 9090 9100 

« 1 i i , - -i . 

8920 TTTTACTGCGGTCCCAAGAGCTTCGGCAGCTTTCTTG.GCTTTATTATGCC 2603_ai2.seq 
8917 TTTT.ACTGCGGTCCCAAGAGCTTCGGCAGCTTTCTTGGCTTTATTATGC. C nem316_al2.seq 
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. .]\J'Tl H_ 'I B ! f "'H II 11 ' } '"H ./ If"'' ./f- ■»!; » ;;!! _ . _ 

TGACTTAGAATTATCGACTACG Majority 



T T T G A 



8970 
8967 



9 * r *° 9120 9130 9140 9150 



9020 
9017 



9070 
9067 



9120 
9117 



9170 
9167 



CGC CATCGTTATTC ATTGAGTTAGAATT. ATCGAGTACG 2603 a!2.seq 
TTTGAAAATTTGGGCCATCGTTATTCATTGAGTTAGAATTATCGAGTACG ne^Tc^E Lq 

AAGACAACATCTAACGGCTTTTGTT T GTCCACTGCTTTTACTATGGTTTT Majority 
91 , 6 ° 9170 9180 9190 9200 

^f^M^^ C JM^^ TTTTCTTTCTCCACTGGTTTTACTATGCTTTT 2603_.al2.seq 
AAGACAACATCTAACGGCTTTTGTTTGTCCACTGCTTTTACTATGCTTTT ne»316_ai2!seq 

TCCACTCACA GTTAA CTCAATTTTATATTTATTATGACCTAAATCACCTA Majority 
92 . 10 9220 9230 9240 9250 

IcCA rl r A r a r II T ° AATTTTAT ATTT ATTATG ACCT A A ATCACCT A 2603_ai2.seq 

TCCACTGACAGTTAACTCAATTTTATATTTATTATGAGCTAAATCACCTA ne-aT6_aiHeq 

CTTCTCAAA TACGTT TACATAATGTTCCCTCTCGAATTTCTCTTATATGC Majority 
9Z ?° 9270 9280 9 290 9300 

iH^f^niT^ 2603„ai2,seq 
CTTCTGAAATACGTTTAGATAATGTTCCCTCTGGAATTTCTCTTATATGC neii316_ai2^eq 

TCACCTT CACTTG AATATGGGTTAACTGCTTTTGCCTCTGACTTTCCATT Majority 

93 }° 9320 9330 9340 9350 

TCACCTTCACTTGAATATGGGTTAACTGCTTTTGCCTCTGACTTTCCATT ne«376_al2 seq 
TGGAACTGAA CCTTTAACA TGCTCA A GTTt AT AACATTGCTTTCTATCTT Malorirv . 
. 93 . 60 . 9370 938Q 939O 9400 

921? TGC AAvlr A Ar rllll^ ^ G C ^ A A G T T.T AT A A G AT T C C T T TG T A TC T 1 2603_.ai2.seq 
9217 TGGAACTGAACCTTTAACATGCTCAAGTTTATAAGATTCCTTTGTATCTT ne«316_aiz!seq 

C A T A A A T TGC T GT.C GG G G G AT A CT G C TT A TCT A C TT C TTC GTG A TTTTC T Majority 

94 . 10 . 9420 9430 . 9440 9450 

£S c aVaa il I r rl 5 J' 5 S S T A C T ° C T T A TCTA'GTTCTT cRt GATTTTGT 26d3_.ai2.seq 

9267 CATAAATTCCTCTGGGGGGATACTGCITATCtAGTTCTTCGTGATTTTGT ne_3i6_ai2^eq 

CCAATTGTGGAATTTTT'ATCAiCCAC TATTTTCTATCGTAGTTTTTC CAT T Majority 
" 94 . 6Q 947Q 9480 9490 9500 : 

S?2 C C A A TTGTGGAATTTTTATCA.CCACTATTTTGTATCGTAGTTTTTCCATT 2603 ai2 sea 
.9317 CCAATTGTGG A A T T T T T A T C.A C C A C T A t T T T G T A T C G T A G T T T T T C C A T T neSl^ai^eq 

ACTCTCAACCTTAACTTGC CAAGTCTGGTTAGTCTTTTTATAACCTTCCG Majority 
9 j>^ 952Q 9530 9540 9550 

A r£H CAA £ CTTAACTTG 00 AACTCT GGTTAGTCTTTTTATAACCTTCGG 2603 ai2.seq 
ACTCTCAACCTTAACTTGCCAAGTCTG. GTTAGTCTTT.TTATAACCTTCGG nei*316„al2?seq 

CCGCTGTTTCTT CTGATA .AAGTATAATCTCCACCTATCAGATTATCAAAA' Majority 
95 J 6Q , 9570 9580 >■ . 9590 9600 

%™ " ? S G G I G 1 1 T I T 3 C T C A * A A A « T A T A A T C T C C A-G .G T A .T G A G A T T A T C A A A A 2603_ J ai2,seq. 
,9417 GCGCTCTTTCTTCTG ATA/A- A ,0 TATA AJCTd C/A. GG TATGAGATTAT-C.A AAA' ^ean3l6^ai_^< 

. 6 T A G C T T C A: C C TCT TAG C T € A C, C A G T T A CT T TT T CTAT TT T A .C T.TTGTCC ^forlfv 

Q? 10 9620 . 9630. • 9640 . 9650 

M*> C T A G C T T C A C C T G T T A GCTCAGC AG T.T ACTT t T T CTATTT TAG T T TCTCG 2603_ai2.seq 
9467 GTAGCTTCA CG TGT.T AGC TC'A G C'A CTTACTtTTTC t AT TT TACTTTCTGG neaaie.aiZ^eq 

ATGAGCAGTA GTTT T T A A A A C!A A A GOT AG C-TTTTC A A AG T G C I TTCTTCT Majority 
96 . 60 ■ * • * ^670 • 9680 9690" 9700 

octT t IS A So A S"I' A G T T T T T A A A A C. A A A G G T A G C T T T T G A A A G TGGTT TG T T C T 2603 ai2:seq 
9517 ATGACC AGTAGTTTTTAA A A C A A A G G T A G C T T T T G A A AGTGG TTTGTTCT nenalejail^eq 

. GCTCATCTGTCTTTTTAAC AACTAACTTTCCTTTAGCACCATTTTCCGGT Maloritv 

9720 9730 9740 9750 

""J CCTC AT CT GTCTTTTTAACA ACTA ACTTTCCTTTAGC ACC A T T T T C C C G T 2603_al2.seo 
9567 GGTCATCTGTCTTTTTAACAACTAACTTTCCTTTAGCACCATTTTCCGCT a^.ailtseq 



9370 
9367 



seq 
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9670 



nr AAA AAGACTT AACGTCAAT ATTTTAG A. A AATTTTTGGTATTTTCTCA TTT 
9667 AAAAACACTTAACGTCAATATTTTACAAAATTTTTCCTATTTTnu^TaS 
TACAACTCCTATTGTGCCGAAATGTCGTTTC T AAATCTAAGATCAGATAf! Majority 



seq 



.seq 



9720 



9860 
-i 



9870 
.1 



9880 



9890 



9900 
_i_ 



TACAACTCCTA T.TGTGCCGA-AATGTCGTTTCTAAATP TA A P A t p a pa t a r» _ 
9717 I A C A A P T P P T A"TTr , f-/' r» ^» r> a « .^~*«,!* . i ' 1AA( * ATC AGATAC 2603_ai2.seq 

.4ACAACTCCTATTGTGCCGAAATGTCGTTTCTAAATCTAACATCACATAC nem316_ai2 



.seq 



9770 
9767 



AGAATATCCTAGAATATACAAACTATCACTT ATTATCATATCAATAATTT Majority 

9910 9920 5930 9940 g^SO 



A C A A T ATCCT AGAATATACAAACTATCACTTAtTATGA TAtCAATAATTT , ft(M lo 
AGAATA-TCCTASAATATACAAACTATCACTTATTATGATATCA AT A A T T T ne!«3)icLal2?seq 



9820 
9817 




CTT ATT AT A AGGTATrPAATTTTA iM ^ TT TCCCAATTTTTGAA A T 2603_ai2.seq 

CTTATTATAAGGTATGGAATTTTAaBgTTTTTTCCCAATTTTTGAATGAT ne«316_aiCeq 




9870 TTTTCT TTTT ATTTC AT A A TC TT A TTTTTT ATTA TC TT A r A A A t a t Vt r 1 " ,<> 
9867 TTTTCTTTTTATTTGATAATCTTATTTTT^^ 

ATTAGCTT.AAGTAGTTCATTTTTCTT T T TTTATGTTTTA A A ATATTC CTT Majority 

• 10060 ' 10 ?70 ' 10080 I00 90 , toioo 

9920 A TflA GCTT AAGTAGTTGATTTTTCTTTTTTT A'T GTTTTA A A AT ATTf PT 1' ^ ,o 

^ attaccttaagtag^ 

aaaaataatgtttgaga g a 

10110 H 

• 9970 AAAAATAATGTTTGAGAG A| B G T T T A C T C A ATTG ATTGAAAATTATTTA P ran* 
9967 AAAAATAATGTTTGAGAG AC AGTTTACTGAAT?GA??CAAA^ 





l^ lS AAAAAC ACATCCTTAATCAAATAAAA.CtTCTAACTT TATGCTATCATTA r rarv* ..c ^ 
10017 AAA A A G A C ATCCTTAATCAAATAAAACTTC T A ACTTTATGCTATGAT T" A C n^ai^eq 




T A C C C T T C C A t T A C T C T A G A C A A A T C A T G T' C "1 : T C A A C t T G G T T T A T C t G A 26m <^ 
10*67 T A CCUT C C ATT A C TC T A G AC A A AT C A T G T-C A T C A Ap TT GCT.f ?}?££S i'!S^S£l q 

.ACTA C.T T A.T. T. A PC A A AT ATKTCATG A I T IAAC AACTT T A T f T A ic A TS T G Ifa ion ^ 

10260 • •' • ' • "• ~ Tr " T ~ 

.1 



10270- 



1O280 
1 



10290 



10300 



10118 A G T A.C.TT A T T A G G A A A T A t T G T C A T G A T T T A A C A A CT T TAT T T A A C AC T P oiino %>,o 
10117 ACTJICTTA'TTAGG'AAATATTGTC^ TC A TTT A A C A A C T T T A T T T A AC A G T .C ^ai^eq 

A A C T C T C T C T G A A T A T C G A A A A G A G T A C A A T A G T C T A C C AA T C..T A AT C G T M a w,* v 



10310 



10320 



10330 



10340 
t 



10350 

_1_ 



10168 AACTCT C T C T G A A T A T C G A A A A G A G T A C A A T AGTC TACCAATCTAATPPT ^hq 
1016T A A C TC T C T C.T G A AT A T C"G AAA AG A GTACAATA GT CTACCAATCTAATCfiT' n^aiCeq 
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A c A A w isiai km ^ ca CC ca 6 gtticctt f rr ^; — 

a0 ? 1Q 1Q42Q 10430 10440 10450 

J^S t c C T T T T A A A A t t 2603 -ai2. S eq 

10267 ACAACTTTTAAAATTTTTCATCACGAATCATTCAGCCACGTTACCTTTAA ne^ai^eq 

CTTACTTTAGTGAAAAATTTGGACT ATCTTGTGCA ACTGCTTATCG CATA Majority 
, 10 f7° ' lOjSO t0 490 10500 

TtTTITtTI^^ 2603,312.3^ 
10317 CTTACTTTAGTCAAAAATTT.GGACTATCTTGTCCAACTGCTTATCGCATA nem316_al2Tseq 

CGAAAACAT ATTAGT CCGTTACTAGAAAAACTTCGATTTCAGATTTTCAA Majority 
„ 1Q ? 10 1Q f 20 10530 10540 10550 

!35 rr AA A A n^ A ^ A ^ TC CH T TACTACAAAAKTTGGATtTUGAT T |8 T C A A 2603 aiZlseq 
10367 CCAAAACATA.TTAGTCCGTTACTAGAAAAACTTGCAT.T tQa GATTTTCAA nendTe.alSLq 

1 AAATAC TATTACCGGTGA C GAGTATCGAATTCGCTATTTAATCGCATTTT Majority 
" 10560 10570 10580 10590 ' 10600 

10417 A A A T A C T A T T A C C G G T G A C G A G T A T C G A A T T C G C- T- A-T T T A A T C G C. A EE T T nem316_ai2.seq 

TAAATGCTCGAT T T CCTATACAAGTTTAT C CCTTGTCTAAGAfGGATAAA Majority 
'_ 10 ? 10 IQg^Q ; 10630 10640 10650 

TYYYtJJTt^^ 2603_ai2.s^ 
10467 T A A A T G C T C G A T T T G G T A T A G A A G T T T A T C C C T TBT C T A A G A T G G A T A A A ne^!6_at2?seq 

. TTGCTTAT C A A ACCATTGTTATTAG A ATACTC AACTACTTTTAC T G C T T C Majority 
, ^fo 0 10670 10680 1O690 10700 

\ot\r T T G C T^T ATr^ A ' A pr A xTrT^- A Tnr A ^ A ^ ^ T^ AA £TAC T TT T AC T G.C T T C 2603_ai2.seq 
10517 TTGCTTAT C AAA C. GAT T G TTATTAGAATACTCAACTACTTTTACTGCTTC nea316_ai2^eq 

T C A T T A C T T C C C A A A T A C A T T T A T T TT CTTTGATACAfTGTTGTCTCTAT Majority 
. 107 *° 10720 10730 • 1O740 10750 

•t2S TITTTIinT^ 2603._al2.seq 
10567 TCATTACTTCCCAAATACATTTATTTTCTTTGATACA.T tQ T T GTCTCTAT nem316_.al2.seq 

CAT. CGA AACGTATT AATTATAATGTAGTTGTCCCTTACTCATCCCTTTTC Majority 
-, , M? 60 10770 10780 IO790 10800 

IoIIt C aIcc t a tS?V. A SI A A ATGT AGTTGTCCCT TAC TC AJTCCCTTTTC 2603_ai2.seq 

10617 CATGGAAACGTATT-AATTATAATGTAGTTGTC C'C T T A C T C A|] C CCT.TTTC n«i»316^iHeq 

ATTGAACTACAAAATATATTTATCT' A TGATACATTACAATATTGTGTCAA Majority 

10 ? 10 . 10820 10830 10840 "To850 

|0668 J^J^CTA AAAATATATT 2603._ai2.seq 
10667 ATTGAACTACAAAATATATTTATCTATGATACATTACAATATTGTCTCAA WlllU-Slseq 

A A A T G T T AT TATA G ATT C C T T T A A A ATT A.A TTT A A A A A A A G ACG AT A T A G Majority 
. rt? 60 - • 10870, 1088 0 10890 10900 

]°Z\* A A A T C T t A J T A T A G A T T C C T T T A A A ATT A A T T T A A A A A A A G A C G A T A T A G 2603 ai2.seq 
:10717 A A AT G T. T AT T AT.AG ATTCCTTT A A A.A T T A AT T T-A A A A AAA G.A Cfc'A.t A T A GhKS, 

ACT AT A T TT T TCT TGC TT A C G T T AC tTCCU T A A t T C T T T t T C T A A T C C XI Majority 
' .' . 10 ? IQ , •■ ;, ■ ^20 1093i0 . 10940 . 10950 

1 ^ 76S AC T A TAT TTTTCT TG C T.'T A C C T T A CTfCC C ATAACTCTTtTTCTAATCcl 2<Htt <^ 
10767 A C TATATTTTT.CTTCCTfACOn.C.TTC;CCATUCTCU 

A A T T G C A C T G A C A A G C G T A T C G A T A A T G.T A A T A G C T A T T T T C G A A A A T T A Majority 
. ^ -10960 .0970 , 10980 10990 : 1.1000 

Jn«l* A A 11 t * A ° A * G C.6TATCGATAATGTAATAGCTA T T TTCCA A A ATT A 2603 a^lseq 
10817 AATTGGAC.TGAGAAGCGTATCGATAATGTAATAGCTAT.TTTCGAAAATTA n^G^^Lq - 

TCCCAAATTCCAAAAATTATTACA GCCACTCAAAGATGCTCTTCCCTTAT Majority 

- 11010 11020 U030 • 11040 11050 

J0868 TCCCAAAT.TCCAAAAATTATTACAGCCACTCAAAGATGCTCTTCCCTTAT 2603 aI2 sea 
10867 T C C C A A A T T C C A A A A A T T A T T A C A G C C A C T C A A A G A T G CBC T T C C C T T A T nea3T6_ai 2 .seq 
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%g££3r W A-qy rajai-a CS* a % r x j c t a t c a t t t t t t t c c r. a u T, 



11060 



11070 

_L 



orlty 



H080 



11090 
t 



11100 

JL 



i ft I C A T C AUG AGTTGCTAAAAGTTGCTATCATTTTTTnicCGA ne «a I e_al2^eq 




TTCATCGAACTATC A X A C A C A X C A X A A A X X A A C T.A r T A X X C X C C C A C r r. A 

11160 1 ~ «- J 

J. 



11170 

ji 



11180 

L 



11190 



11200 

JL 



TAATTACAUnCGTTAAGT fi AAA.nr.U6AAT A ^r , nTAACr.nrM 

n ? 10 "?20 . U23Q H240 ~ 



11220 
L 



11230 

_L 



T 

11250 
1_ 



ss ? i : ? ; ; j ; ; ; ; j :: ? ; ; s n ; ; ; ; ; • RnnnnJIHilfc ; i s 5 ; rasa, 

t ....l.„.,..„„„„,„„„, I Kl.lt... ...■».,„. W „ 



11117 C A 
11117 C A 



i 11260 _" 2TO U£80 -11290. ■ uix> 



TCCT'CCGATA CAGATAGCC G T AC T AACTACACA.CTTTATTA 

U310 U320 ~* 

J — i 



AT A AC'C AA'A Majority 

11350 

__L 



' , "? Z0 . »?30 „340 U3S0 

- T TTTAACA C AATGTTTATTACACACGT.TTTCTTnT AAACAr.ATTn A t.tx. fej™ 
. "360 „370 n380 u890 r ~ 

? ? ? I M g * £ * f I S" T -I A * C B T T T T C T T C T A A aT *- 



11400 



11217' TTTTAACAGAAT gTt t A iH* C A G A G™X T'X X 6" X X C X A A A <?A GATTCATTTC ne^76'al2?seq 
. ■ C " Ct ""T'""TT» ACTGA.TCATCTTTCCAATATT A CTA ATGTTA ■, KaJorlty 

- "? 10 »»30 n^o " 77 4 - 50 . 




1 1 460 1 1470 

1 1-317 TCCAGAOATTATTATTACCA A-TH Ba AAGCTTTrtWrrTTiiTr a , , ^ 

ii317tcca G ataxxattaxxa C c.aa|^a^ A ^^TI1?|IT|Ta^ 

AG.ATXTCXXCACACAGTTTAA XXACAT A T ATXGATTXGATTCAXAr.TTeA UaJorlty 

- . - - . * - - •-*.." 



G A CC Afi'ATCA'Af C AA AK 0 A AC A A AT T A IT T C A t f. A A T AG A C- C A AG *JA * A Bajorlty 

11560 « ----- ^ J ^ 

JL 



11570 

-iJL 



11580 

JL 



11590 

-L 



11600 

JL 



11417 G A C C AB A T C A" AT C A A AT C C A AfclA A A T T A T T T C A T C A A T a r a p r a a V a a , V ^ 
11417 G A C C A G AT C A* A T C A - A A^ TCC A A^ A A : A'T TA T T T C A T C A A T A:C .AjQ G A A G" A A .A A ne^T6_*.al2^eq 

• ATATTGTAAACTTTTTGCA A A A A C T A A T G A A A T A- A C T A C T C G T A G P T r ^ r ^ OPlty 

at 



11620 
ji 



11630 

L 



11640 

JL 



11650 

—JL 



ATAA.C-TCXXAAAAAXTAACAXXAAAAAGn -rACAGCATXGXGXAATr.r T r-r Majority 

11670 • " "680 . . 11690 H700 



18 
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A C C T T 



^ gjg^ I g gg Tj-t g »l T il IP A T A T A ATCCAACTTT TCA ACTC Hz, ^ 



11710 
1 



11720 

JL 



11730 
JL 



U740 



11750 

— L 



I1S67 A G C T T T T ^^^ ATTTTTTTGAATAA ^ ATAATC CAA " TTT CAACTG 2603 a!2 sea 

1 1567 AGCTTTTTAATGTTAATTTTTTTCAATAATATAATCCAACTTTTCAACTC neJl^a i ^seq 

TTTTTTC CC ATCTG A A ATCTTCTTT A ATTC TTTT ACQ A AT ATTC TGTTGT Majority 



11760 



11770 



11780 

-JL 



11790 
jl 



11800 



11617 TTTTTTCCCATGTGAAATGTTCTTTAATTCTTTTACCAATATTCTGTTGT neiSlelai^seq 
AGTTTCTCTCTT AATGC CT T.A TCTTTTACTAAT A A ATCAAGAGA T.T C A T G Bajortty 



11810 
-■ 



11820 



11830 



11840 

JL 



11850 
J- 



uSJ A G T^CT^CtIa*^ 2603_ai2.seq 
11667 AGTTTCTCTCTTAATGCCTTATCTTTTACTAATAAATCAAGAGATTCATG nein3l6_al2~seq 

CAGTCACTGAGTATTTTCTTCCATGATGA T TCCTAACTCAGGCCTATCAA Majority 



11860 



11870 

JL 



11880 

JL 



11890 

JL 



11900 

JL 



11717 G A G T G A CTGAGTATTTTCTTCCATGATGATTCCTA ACTCAGG C P T A TP A 1 9^, 
11717 GAGTCACTCAGTATTTTCTTCCATCATCAT^^ 

TAACT-TCAACTGTTCCACCGCGA T C TCTTC C AAT A A TA CCACTTGAAAGT M a wi, v 
- U ? 10 U ? 20 ] H?30 11940 ^ 11950 - 

UlVl VkkIVAkKcI^^^ 2603_ai2.seq 
11767 TAACTTCAACTGTTCCACCGCGAT C T CTTCCAATAATAGCACTTC A AACT ne^l6_ai2!seq 

AGACCAGCTTCTAAAATAGAGGTTGG T AATCCCTCTGGA T A C A T T G A A C G Majority 
. t1 ? 60 u ?70 U980 11990 12000 

lim AC A C C A C f T ? r T A a A ^x A ^a "/» TTGGTAATC. CCTCTGGATAC ATTG A A G G 2603_ai2.se q 
11817 AGACCAG.CTTCTAAAATAGAGGTTGGTAATCCCTCTGGATAC ATTGAAGG neaSlSjaiZL, 

GTAAACAAAGATATCAGTCTG.TGCCATT AAAGACATAGTCTGTTCAAAGT llaiorirv 
. A2 ? 1Q ^020 . 12030 12040 12050 

\\ltl 5 I A A A '? A A A J A J A Jjj ACT C T G T G C C A T T A A AG A C ATAGTC TG t TC A AA-GT 2603 ai2.seq 
.11867 CTAAACAAAGATATCAGTCTGTGCGATTA A AG A C A T A G T C T G T T C A A A G T ne^Te^lHeq 

TTAATTTCCCCAAAAACTTAATCTGTT T GG.A CTGATATTT.CTCTTTCAA A Majority 



"1206O 

JL 



12070 
1 



12080 

JL 



12090 
JL 



12100 

JL 



Wlli T T A A T T TCC.fr AAAAA ?H AA .I"5"««*CTCAIAinCTCIITCA A i. 2603_ J5 l2. Seq 
11917 T.TAATTTCCCCAAAAAGTTAATCTGTTTGGACTGATATTTCTCTTTCAAA nea316_ai2!seq 

TGTQCTAATTCAGGTCCGTCTCCTGCA A TCTGTAAATAAACATTTTC.AGA. Majority 
JfJ^ 0 . ^2120 12130 12140 12150 

U967 TGTGCTAATTCAGGTCCGTCTC, CTGCAATCTGTAAAT.AAACAT T*T T C A G A ne*3l6_ai2.seq 



GT ACTGTG A.C AT'CG A AAATGCT T. C T A A G A G G A.ATTC AATG.C C TTTTTC.TT Majority 
. ffjfg, ., t2 P° . X2180 12190 . 12200 

S£ * G T A C T G T" A Pi " A T' P ^ A 1^ J a^ V j^r* «r «!p 1^ S A ^' A A ^* ^ ^* A. A T G C C T T T T T C T T 2603lal2.se* 

12017 G T A C T GTG.AC ATCflAAA AT G„C TTC -T A AG A G C A A T T C A A T G C C T TT ? TC T -T* h^i6_al 2 I?eq 



* : - T A A T A A'T tC T A C C A G C A T A A CT-C A T G A A A A UTC A T-C AG C A G A t. T t T T C A Majority 

.12240 ' 12250 



12210 

JL 



12220. 

JL 



12230 

JL' 



S T A A T A AT t'c't l^ri r pat a^ a S ^ A A TC. A GC AGATTTTIC A 2603_at2.sei| 

12067 T A A T A A T T CT A C C A G C A T A A G T G A T G A A A A T A T C A T C A G C A G A T T T T T C A ne«316>i23eq 



A G G T A-A GCCGTACC AG CAAAATCAG A G C C TAG ACTTTCAfiiT ACCC A ATT Majority 

12290 . 12300 



12260 . 

JL 



12270 

JL 



12280 

JL 



■ma ir'fr! ac r r > I A S S A t n A k . A A T C A G A G G c T A G ""nmriccctiti 2603_ aI 2. seq 

12117 AG G T A A G C C.G T A C C A G C A A A A T C A G A G C C TAG AC TTT C A GA T AC C G A A T T ne*316_alf?se<j 

ATAAATAACTCC TTTA GCTTCfATATTAAAATCTTTTAACCATTCAACCC M a io r l«:v 
- 12 ? 10 »2320 12330 12340 . 12350 

\D" A J AAA ? AAG I GGTTTAGCT ^ CTATATTAAAATCT TTTAACCATTCAACGC 2603 at2.sen 
12167 ATAAATAACTCCTTTAGCTTCTATATTAAAATGTTTTAACCATTCAACGC nemsTe.a^^eq 
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rity 



Jg£ 1 1 5 I £ I ™GATACCCCATAAAAATCTGCACGATAATGCTTAACACGCGCT 2603 aI2.se, 
12217 TTCTCTTGGATACCGCATAAAAATCTGGACGATAATGCTTAACACGCGCT ^l^lTLq 




Jf?!! Hr A r AAGA I C II^ ATAGJlTAGCTCCAAACAAA TCTAAAAAACGATTATT 2603 at2 sea 
12267 GTGACAAGATGTTCATAGATAGCTCCAAAGAAATCTAAAAAACGATTATT .H^IsIaii^L 

GACAGAAAAATGACTTCACCCATG-CTCT A AAACAATACTACGTAAATGGT Majority 



12460 



12470 



12480 



12490 



12500 

JL 



ra£ C r £ r a" a AAA ™ A CTTGACCCATGGTCTAAAACAATACTAGGTAAATGGT 2603 ai2,seq 
12317 GACAGA-AAAATGACTTGACCCATGGTCTAAAACAATAGTAGGTAAATGGT ne^a^ec, 

CATTCTTTGCAAAAGATACCCCTTCTAG C CTTGTTAACTGAAAACCTGTA Majority 



12510 



12520 
JL 



12530 

JL 



12540 
JL 



12550 
jl 



12367 GATTCTTTGCAAAAGATAGCCCTTCTAGCGTTGTTAACTGAAAACGTGTA ne^l6_aiHeq 
TTACAAATCAGAAAATCAATATTTTCA T CTGAAACATATTTCATCAGCGT Majority 



12560 



12570 

JL 



12580 

JL 



12590 
i 



12600 

JL 



1*242 T T A C A A A TrAr AA AATr AA Tr ATTTTCAT C GAAACATATTTCATCAGC'GT 2603_ai2.seo; 

12417 TTACAAATCACAAAATCAATATTTTCAT.CTGAAACATATTTCATCACCGT ne^lG^iHeq 

G T TCTA T T C TCCAT T-TT TCTTA ATAATAGGATAGCGCTGCTTCACAATAT Majority 

12610 8 
i 



12620 

JL 



12630 
t 



12640 

JL 



12650 

JL 



C T T G T 'A T T d ?Sr J A I A ATAGGATAGCGCTGCTTGACAATAT 2603_ai2.seq 

12467 GTTGTATTCTCGATTTTTGT T A ATAATA.GGATAGCCCTGC.TTGACAATAT' nem3l6_ai2^eq 

TTTTGGTCG GTAA ACG CTAAA T T TTTCTACCCTTG f C T TC A TCTATAATC Majority 
12660 . 1267 ° 1268Q 12690- 12700 

12517 T T T T G G T C G G T A A A C G C T A A A T T T T T C T A C C G T T G T C T T C A TCTATAATC ne»316_jU2.seq 

GCTAAATC ACC ATG ATTACTTGTT AC AATA AC AACACCCTA GCC'AC GCTT Majority 
. 12 ? 10 I2 ? 2Q 1273Q 12740 12750 

l^tl II^HTT^^ 2603_ai2.seq 
12567 GGTAAATCACCATGATTAGTTGTTACAATAACAACACGGTAGCCACGCTT ne*3l6_ai2.seq 

A ACC AA A TCT C C-TG TC AT T T T ATC T GT A T A AC C TTC A AT AC C TC C G AC G A Majority 



12760 
J. 



12770 
J. 



12780 

L 



12790 



12800 
j_ 



SSI A A ^ CAAA I C I GCTGTCATTTTATCTC ' rA TAAC GTTCAAT ^ CCTCC ^ 2603 al 2.seq 

12617 AACCAAATCTCCTGTCATTTTATCTGTATAACGTTCAATACCTCCGAGGA nenaleLai^L, 

AGCCTAC AT A A T A TCCTGA GAA.AACACCAACTGTT.TTTACCTTATTTTCC Majority 

. 12810 . . * 2 ?Z0 , ■ , 12830 ,12840 ■ . • 12850 . 

12667 AGGCTAGATXATATCCTGAGAAAACAGCAACTGTT T T T A C C T T A T T T T C C 2603 al2 sea 
12667 A O 0-6 T A G A T AA T A T CC T G A G A A A A C A G C A A C T OT.TTTt A.C C T T A f t T T. C £ Slai^L 



A TATT TAT C C A CTT TC AT C A A T A A G.C C A TCT TTTA AG C C$XJT AA TC AT A>, Majority 

12880 ... 12890 ! 12900 



12860" 
jl 



12870 

JL 



A I A 1 1 1 A I ^ C A C T T T C A T C A A : T A A G C C * T C t T T T A A G C Q:t T.T A A t C A T A 0 2603^,12 ied " 
.12717 ATATTTATCCACTTTCATCA A.TAAGCC A T C T T T T A A G C C T T T A A T C A T A G ^wjif^eq 

CAACTATTTTT TfC CTCTTT K.CT-C T T C T C C T A C C A A C A C T C G A A CA A A T Majority 



12910 



12920 
J 



12930 

JL 



12940 

JL 



12950 



IH« £ A A SI A J II I I T TCCI - CTTTT C/C-T C T T C T, 6 CTACC. AA'CACTCGAACAAAT 2603 a 12 .seq 
.12767 CAACTATTXTTT. TGCT'CTTTTG. CTC.TTCTGCTACCAACAC.TCG;AACAAA T ' nem316_al2?seq 

* • * 

TCATTTfrGCATAAATACTAAATATTTG T ATCCCTTCTTCTTACC AT ATTT Majority 

12960 *o«™ * — — r- T ~ 

j 



12970 
jt 



12980- 
i. 



12990 
jl 



13000 
JL 



12817 TCATTTCGCA T A" A A TACT A A AT ATTTCT ATCC CTTCTTCTT ACC ATA-TTT 2603 a!2 ^ 
12817 TCATTTCGCATAAATACTAAATATTTGTAT CC CTTCTTCT TA CCATATTT ne^t6_ai2^eq 
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T T T T P ffW tfilSi^^ 3* gi "^'ffi A T ^^"^ ATATTTTCGAAATCGTC MaJoTI^ ' 
r3 ? 10 13020 13O30 13040 13050 

!^IIII A I AA I A *^^ 2603_al2.seq 
12867 TTTTATAATATAGATCCCATTCCGTATCATGTAATATTTTCGAAATGGTG nem3I6_ai;Tseq 

AATCATTCAATACATCAA AAACATCCCCAAATTTTTTAACTCCT-CAAGAG Maj orl ty 
13060 13070 13O80 13090 13100 

J 2917 AATGATTCAATACATGAAAAACATGGCCAAATTTTTTAACTCGTGAAGAG 2603 ai2 seq 
12917 AATGATTCAATACATGAAAAACATGGCCAAATTTTTTAACTCGTGAAGAG m^76_ai2!seq 

TCTCCAATTTCCTGTAA CACACCAATAAAATTAACCTCATAA-GTCTTATA Majority 
• 13 1 10 *3j20 13130 13140 13150 

If™ TGTCCAATTTCGTGTAACAGACCAATAAAATTAACCTGATAAGTCTTAT A 2603 ai2.seq 
12967 TGTCC AATTTCGTGTAAGAGACC AATAAAATTAACCTGATAAGTCTTATA nemsle.al^eq 

TCCCATCTCTCACACACCAT AATTCAT T.T CAGAGTC. AACAAAATCAATAA Majority 
13 . 160 13170 13180 13190 13200 

)™\Z TCCCATCTCTGACAGACGATAATTCATTTCAGAGTCAACAAAATCAATAA 2603 a!2.seq 
13017 TCCCATCTCTGACAGACGATAATTCATTTCAGAGTCAACAAAATCAATAA ae^W^iZseq 

ACATCTCTTCTCCAAAGC CACATGTTTCTTCGAAAACGCTCGTTTTCATT Majority 
^_ 13210 1322Q , 13230 13240 13250 

13067 A. C„ A T'C TCTTCTGCAAAGCCAGATCTTTCTT'CGAAAACCCTCGTTTTCATT 2603 ai2 seq 
13067 ACATCTCTTCTGCAAAGCCACATGTTTCTTCGAAAACGCTCGTTTTCATT nemsTe^ai^Tseq 

AAACCAGCG C A A G T A AT A C A C T C T T, C A A T T T C T T t A T A G T C A A A TTCTTG Majority 
13260 13270- 13280 13290 13300 

J!?!! ;^5 C ^ CCC ^CTAATACACTCTTCAATtTCTTTATAGTCAAATTCTTG 2603 a 12. seq 
13U7 A A A G C AG CCCAAGTAATACACTCTTCAATTTCTTTATAGTCA AATTCTTG neaSie^alSTseq 

CATCA CTAAATTTTCACCGTTCATATCTTCATACAAACAAGATAACATAC Majority- 

13310 , 13 320 13330 . . 13340 13350 

J 316 ! ft I? A S TAAATT ^ TCACGGTTCATATCTtGATACAAA P A A-C.A T A C 2603 ai2,seq 

13167 CATC ACTA.AATTTTCAC.GGTTC ATA TCTTGATACAA ACAAGATA ACATAC ne^Te^aiHeq 

CGA'CCTTAGCTAAATGAAG GTAATTTTCATAATTATCtATC.AAATCACCT Majority 
. 13360 13370 , 13380 , 13390 13400 

13217 CGACCTTAGGTAAATGAAGGTAATTTTCATAATTATCTATCAAATCACCT 2603 aiZ.seq 
13217 CGACCTTAGGTAAATGAAGGTAATTTTCATAATTATCTATCAAATCACCT ne«316„ai2!seq 

AGGACAACCGAATCTTGA T.CTAAACTCAAG AACCAATCAUTTCTTGTGC Majority 
13410 13420 13430 13440 \ 13450 

13267 AGCACAACCGAATCTTGAttTAAAGTCAAGAACCAATCAAATTCTTGTGC 2603_ai2 seq 
13267 A G G A C A A C C G A A T C T T G A T C T A A A G T C A A G A A C C.A A T C A A A T T C T T G T G G nemSie.al 2. seq 

TACTGCAAATT GACCGATACAGTTCAAAGC ATATGC AATCCCTTTATTTT Majority 

13460 , . 1 34 7Q .' ' 13480; . I 1 34 9° 13500' 

13317 T A C I G C A A ATT-G A C C G AT A C.A G T T'C A A A G C.A TA'.TGC AATCCCTTTATTTT 2603 ai2 seq 
13317 T A-C .T-G C.A .A A.TTG A.C C G A T A G.A.GTT C.'A-A A G,G ATA T G C -A-A-T C'C' C'T- T -T*A T*T T .T ne^is ai2^eq 

C T G.t T A A AT A AT.C A ACAGTT A G. G T G CC C V T CTT C A X T A T A A T C G GCt AC X Majority . V 

* 3 ? 10 ,13526 . 13530 13540 13SS0 

13367 C T G T X AAA T A A T C A A CA G T T A G G T G C C C G TGTTCATT A T A A TC G G C TACT 2603 4*12 seq 
13367 C T G T T A A A T A A T C A A C A G T T A G G T G C 6 C C T C T T C A T T A T A A T C G G C t A C T ne^Tfi^l^seq 

A A T T G A G A A A T T T C T T C C T T A T T T T T C G A G C C A T T A T C T A C C A T. A T A G A T Majority 

- . . * . . ... 1356Q . 13570 13S80 . 13590 13600 _ " 

13417 A A T TG A G A. A. A J T T C T T C C T.T ATT T f T C G A G C C. A T T A T C T A C G A T A T A G A T -2603_jU2: S eq 
13417 AATTGAGAAATTTCTTCCTTATTTTTCGAGCCATTATC.TACG ATA TAG AT nem3 I 6_al2!seq . 

C TCCCTTACTTGACGATAAATTCCTCC AATGTTCTGATCTAAGCGTTCA A Ma tori tv 
J 13610 ■ , 13620 13630 13640 13650 

13467 GTGGCTTACTTGAGGATAAATTGCTCGAATGTTCTGATCT'AAGCGTTC A A 2603 ai2 seq 
13467 GTGGCTTACTTGAGGATAAATTGCTCCAATGTTCTGATCTAAGCGTTCAA ne m 3Te_ai2 seq 
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T A T T G g CCTT A A B A C A A T A C C CGCTA AATA TTTCATGTTCTATGCT l^oTI^ 

TiTrlTITT^^ 2603^ai2.seq 
13SI7 TATTGGGGTTAAAGGTGACAATACCCGCTAAATATTTCATGTTCTATGCT ne ffl 3 I6 _aI2^eq 

CTTTTCTAAAA TCTC TAAATAACTCAATGACTGGTGCTTTGGTTATAAAA Majority 
13 7*° * 3 720 13730 13740 13750 

13567 CTTTTCTAAAATCTCTAAATAACTGAATCACTGGTGCTTTCGTTATAAAA nem316_ai2.seq 

ACGATACCGACATAGAT A CTTACTCCTACTAAACTTTCAATGACATAATT Majority 
• ; • l3 '{ G0 '3770 13780 " 13790 13800 

13617 ACGATACCGACATAGATAGTTACTGCTACTAAACTTTGAATGACATAATT 2603 ai2 sea 
13617 ACGATACCCAUTAGATACTTACTCCTACTAAACTTTGAATGACATUn £3iK^ 

TACCAAT GATACTCACATTTGAGTATTGATATAATAGAGTACACCTCCAC Majority 
, 13810 , 13820 13830 13840 13850 

™fl TACPAATGATACTGACATT-TGAGTATTGATATAATAGAGTACAGCTCCAC 2603 ai2 seq 
13667 TACCAATGATACTGACATTTGAGTATTGAT AT AAT'AGAGTACAGCtCCAC ne^Te.aSeq 

TA AG A CTAGCAGCAATTAAATACCGCAGCATTCCTCTTCTTAATTCTTTA Majority 
'* 13860 1387Q 13880 13890 . 13900 

*? 717 TAAGAGTACCAGCAATTAAATAGCGCACCATTCCTCTTGTTAATTCTTTA 2603 ai2 sea 
13717 TA AC AGTAGC AGC'A ATTA AATAGCGCAGGATTCCTCTTGTTAATTCTTTA ae-IlK^ 

AAAGTA AATACATCT .CTTAAAGAGATAGCTTGATATAGGGAGACAATAAA Maforitv 
■ • . 13 ? l ° 13920 13930 13940 13950 

«™ * T A C A T C T C T T A A A G A G A T A G C T T G A T A T A C G G A G A C A A T A A A 2603 al2.seq 

.13767 AA ACTA AATAC ATCTCTTA AAGAG ATAGCTTGATA.TAGGGAGAC AATAAA nem3l6_ai2^eq 

T f C A C T A A T A A C T G T A G A G A T A A T A G C T C C C A T A G C A C C T A A A A T T G G T A Major I ty 
•.- * 3 ? 60 13970 • 13980. ' ; 139 90 14000 

\™\Z IICAGTAATAACTGTAGAGATAATAG C T C CCATAGCA C C T A A A A T T G G T A 2603 ai2.seq 
13817 TTCAGTAATAACTGTACAGATAATAGCTCCCATACCACCTAAAATTGG T A nemsTelai^eq 

;TTAAAAGTATATTAAGCACA ACATTTGCCA.CAAGTCCAATAACTGCAGAC Majority 
, 140 *° ^020 14030 14040 14050 

J 3 * 67 TTAAAAGTAT^ATTAAGCACAACATTTGCCACAAGTCCAATAACTGCAGAC 2603 ai2 sea 
13867 TTAAAAGTATATTAAGCACAACATTTGCCACAAGTCCAATAACTGCAGAC ne^ai^eq 

ATTGTGTAAGCTTTTGTACGTCTTGAAGCC.AGTAGATACTGTGTCCCTAA Majority 
. 14060 14070 14080 14090 14100 

ATTCTGTAAGCTTTTGTACGTCTTGAAGCCAGTAGAIACTGTCTCCeTAA- 2603 aiZ.seq 
13917 ATTGTGTA AG CTTT'TGTACGTCTTG A AGCC ACT AG AT ACT GTGTCCCTAA nem316_ail!L, 

- AGCGTTACCATA AGAAATGCAAATGATCATCAAA Majority 
14110 . . • .14120 14130 

13967 A G.C GT.T.ACCATA AG A A A T G C A A A T G A T C A TC A A A 2603 ai2 sec 

13967.AGCGTTACC A T A A G A A A T G C AA ATGATC ATCAAA he^TiS.aiHeq 

Deooratlon 'Decoration Shade (with solid black) residues that differ from the Consensus. 
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with Weighted residue weight table. 



CT/US2005/027239 , 



1 
1 



51 
51 



10 1 
101 



151 
151 



201 
201 



251 
.251 



301 
301 



351 
351 



401 
401 



451 
451 



lO 



20 
_1_ 



30 



40 



SO 



T C C A C A T C C rTrrA^T^^^^^ TATG ^ CGT A C C AGTAATTCCC oohl a!2 sea 

TCCACATCCGTCCAATTAACATATGACCTGGCGCATCACCAUAATTCGG Sofl^t^ 




A AT J^^C^^TGA^T^r ^ f ^ ^ A A A C G A C C T T C 1 T. G G T A A T C T C cohl.a^seq 
AATATAGTCTTCTTGACTTAACAAACGTAAACGACCTTCATCGTAATCTC a909_a!2.ieq 

TCTGCATTCTTGTATTAGTCATAAGATGCAGA ACGTGTAATTTTATACCr Majority 

_ l 7> '?<> 190 200 . 

IgI^ A II^ TTGTATTA HTCATAAGATGCAGAA GGTGTAATTTTATACCC cohl^.seq 



«TCCATTCTTCT*TIACTCATlACATCaCAA fi Siii A Iiii}I};^^;q 




ATGTGTCTCCCCTGC.CAGACCATTTATTACCTGAC AAACf ATTTTTif TT *,« 

A T G T C T r T rrr «r r» r» /• . - - «^ « . A. A o U A T T T C T A- C T T cohl__ai3.seq 

ATGTGTCTCCCCTGGGAG AC CATTTATTAGGTGAGAAACGATTTCT'ACTT a909_al2:seq 



TAGCAGCTAATTCTCGTATTCTCTTAACAG T T T T T T T C T A A A G C T C A T A T Majority 

310 320 330 * ^ 

J 1 — _t 



340 
i 



350 
_L 



T A G G A G C T A A T T CTCG TAfltC TCTTAA.C A GTTTTTTTGTA AAGGTrA TA I '^i ,o 

? a **agctaA TXC tcgtattct^ 

GAAT.GTGCTCTATTTAf TAATGCAGAACTT G CT-TCATAAGTTGTCTGAAr. Majority 



400 
JL 



GA ATGTG CTCTATTTATTAATrrArAA^ cohl^sea 
GAATGTGCTCTATTTATTAATGCAGAAGTT'GCTTCATAAGTTGTCTGAAG a909_al2.seq 




A T A ? A It ?^CATP'Ar^ A A?r M^CGGC TTCTTC€GAtGTT : GA.Teeci ^hl^a^.seq 
AT- A.T.AS-T T TC AT C A G-G T A AG CA A T ; C C G G CC T T G TT C CGATGT T G A TC C C.G a909Xal2 Jstq 

- A T.A. A CT C C T G GCK AT T AA IAGCC TGTT € G T A A C G C T C T T T A A T T A T C TV. if.m^ 

"•- #P : / 520 . . . -.'son - *\« . . *~ * 



501 
501 



551 
551 



T A A C T TA G C A T G G G T A T T G G T A A A.ATTTT GA AAATAGACTAAGTATTTAT Majority . 

560 570 580 590 ~600 



601 
601 




TAACCTCAGGCCACTTTCTATGC ATGAAATCAATTTCTTTATAGA ATTGT a9Q9_alz:Seq 



UoZ^aUx Weiahted residue weioht tnhte 
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T C A C <^Agy y g^,^ y^gtfy TAGCATCCCCTGAACCAOAAAC Majority 

6( P JJ80 690 700 



651 
651 



701 
701 



751 
751 



SOI 
801 



851 
851 



TCA.CCAATAGGAGCTTCTGGAGCAACTATAGCATCCCCTGAACCAGAAAC a909^ai2.seq 

TGTCCAAA AAGTCC ACCCTCCTCTACCAACTGTTCCATCTCTCTTAGGAC Majority 
7 j Q ™ . 73Q 740 750 

^r?^rwM^^ CACCCTCCTCTACCAACTGTTCCATCTCTGTTAGCA ai2.seq 
TGTGCAAAAAGTGCACCCTCCTCTAGCAACTGTTCCATCTCTGTT.AGGAC a909lai2.seq 

ACTCAAAACCAGCATCTA TAGGTAATTTAAATATTTTTTCTCCAAAGAGT Malorltv 
, • 32! 77 .° 780 790 800 

AGTeAAAACCAGCATCTATAGGTAATTTAAATATTTTTTCTCCAAAGAGT a909lal2_seq 

TCTCCA TAATAAT CATTAATCGCACGATAACGTTTTTTCATACGATAATT Malorltv 
. - 8 j: Q **20 830 840 . 850 

J^^ ATAATAATCATTAATCGCACGATAACGTTTTTT CATAGCATAATT cohl al2 seq 
TCTCGATAATAATCArTAATCGCACGATAACGTTTTTTCATAGGATAATT a909_aI2.seg 

GTATCACAA TTTTAAC TAAAATAACCTCACTACTACAATAAAACTAAAAA Majority 
8 f> ' 870 880 890 900 

^ -r ^ A |J AA T ^ TAA *' TAAAA TAACCTC A c TACTAC AATAAAACTA AAAA cohl a i2.seq 

GTATCACAATTTTAACTAAAATAACCTCACTACTACAATAAAACTAAAAA a909lai2.seq 

• - 

AGATTGCAACGTCAG T'T A GTTC C AATCTTTT ATTTA CT TCACTTTCTTTA Majority 



910 920 930 



901 
901 



940 950 



951 
951 



„AGATTGGAACGTCAG'TTAG TQ CCAATCTTTTATTTACTTCAGTTTCTTTA cohl ai2 seq 
AGATTGGAACGICAGJTA GTT- CC AATCTTTTATTTACTTCAC TTTCTTT A a909_aI2lseq 

A CG A- ATCCTTGGCTAAAAA GAT. AT ACGCA.GTTAG ATT CAAAATACCATAA Majority 

960 970 * 980 990 1000 
' ' ^ „wM"jr . — : — •• rf — i — : 1 • , t 



.1051 
1051 



A C ClA TCCTTG GUT A A A A AG A T A TA C G C A G T T A G A T T C A.A A A T A C C A T A A cohi- al2.seg 
AC.CAATCC.TTGGCTAAAAAGATA. TAGGCAGTTAGATTCAAAATACCATAA a909lai2.seq 

CCAACTA.TAAAACCAGCT AAAACATCTGTCGCAAAATGAACCCCTACGTA Majority 
*q t0 1020 1030 1040 1050 

!™J ^ CAA SI ATAAAACCAGCTAAAACA T"GTCGGAAAATGAACCCCTAGGTAcohl a!2.seq 
10O1 GCAAGTATAAAACCAGCTAAAACATCTGTCGGAAAATGAACCCCTAGGTA a909~ai2.seq 

AATACGAGATAACCCA ATTAAAAAAATGAGCAAACCCAATGTACCTTGGC Majority 
^ 10 t 6Q 1Q70 1080 1090 1100 

AATACGAGATAACCCAATTAAAAAAATGAGCA.AACCCAATGTACCTTGGC cohl ai2 seq 
A A T A C G A G AT AACCC A A T T A AAA AAATGAGCAAACCCA aQDtA CCTTGGC a909_al2.~seq 

A C A A C A G T T T C C A T A T A C T C T T A G G C A T A T A G T A G T G C A A T A A A A T A A T A Malorltv 
•. • 1110 • 1120 1130 1140 1150 

* S M S A S T T ^ C€ A T A T A T T A . GG C A TAT A G T A C T G C A A T A A A A T A A T A cohl a!2.seq 

1J0I A C A AC A GTT T C C AT A TACT-C T-T A G G C AT A TACT ACT CCfA AT A A A ATA A T A a909"ai2.seq 

• . . w - 

C T A CT C € C A A AT AT- C A T A A A T G T t C € € AT : G .G.A G T G C CCA CT IcCCA A A CCA MaWltv 
- - fj£L ' H 8Q ' * 1190- 1200 : 

SiK' SlU A A T A T C A T A A A T e T T C C C A T C G ACT G C C C A CT G G G A A A C G A cohl alB.seq 

US* HT A C T C C,C A A A T A T C A T A A A T* G TTCCCATCGAGTGCCCACTGGGAAACGA a909lal2.seq 

- . A T A G C C A C C t G C A A. A T A C T A A A T G G G T T A A A C T T G G T C T C A C T C T T T G A. A Majority 

Jgg . 1220 1230 .1240 1250 

I20i A TAG C C A -C CT G C AAA TACT AA A* T .G G GTTAAAGTTGGT'CTCACTCTTTGAA cohl ai2 seq 
1201 AT ACC'C A.CCTGC AA'ATA C T A AA T GCCTTAA AGTTCGTCTC ACTCTTTGAA a909_ai2[seq 

A A A T A AG TTTT A A A GA A AGTATACATA.TA CC AGAGATA ATA GCA TTT A CT Majority 
. 1260 1270 . 1280 1290 1300 

1251 AAATA AGTTTTAAAGAAAGTATA CA TATACCAGAGATAATAGCATTTACT cohI_aI2.seq 
1251 AAATAAGTTTTAAAGA A. A GTATACATATACCAGAG AT A ATAGCATTTACT a909_al2.seq 
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jjji * ft I fejj pgj Cy TCTTAAGCTAACAGAAACTCK Halortrv ~ 

^tlO 1320 1330 1340 1350 

J^ 1 GCGATAAATCTAGCTTCAGGATACCACTTCTTAAGGTAACAGAAAGTGAC cohl al2 seq 
1301 GCGATAAA T C TAGCTTGAGGATACCACTTCTTAAGGTAACAGAAAGTG'AC agOQaiziseq 

GCTCATAATCGCAATAGC TATCTGGCTTACACTATTACCAACCACACTGA Majority 
^ 1370 1380 1390 1400 

GCTCATAATCCCAATAGCTATCTGGCTTACAGT-ATTACCAACCACAG.TGA coM a!2 seq 

GCTCATAATCGCAATAGCTATCTGGCTTACAGTATTACCAACCACAGTGA aSOsIa^seq 

-. . * 

TTAACTTGAAAA ATCTTGTAGAAAGATTTCGCAACTGTCCTCTAACACTT Majority 

. * 4 t 10 1420 1430 1440 1450 

1461 T T A A C T TB A AAAATCTTGTAGAAAGATTTGGCAACTGTCCTCTAACACT T cohl_ a i2.seq 



1351 
1351 



1451 
1451 



1501 
1501 



1401 TTAACTTGAAAA A.T CTTGTAGAAAGATTTGGCAACTGTCCTCTAACACTT a909_ai2;seq 

TCTTGAATAGTT TGGTCAAATGCGATTACAGTGTCGGGCCAATATTTGAT Majority 
1*60 147Q 1480 1490 1500 

TCTTG AATAGTTTGGTC A A ATGCGATT ACAGTGTCGGGCCAATATTTGAT cohl ai2 seq 
T.CTTGAATAGTTTGGTCAAATGCGATTACAGTGTCGCGCCAATATTTGAT a909_.ai2.seq 

G A CCAATCCTAAACTGAAAAATAAGATAATAGCAA T.A A A TGCTTGAATAA Majority 
1S10 - 1520 1530- 1540 1550 

CACCAATCCTAAACTGAAAAATAAGATAATAGCAATAAATGCTTGAATAA cbhl_al2 seq 
GACCAATCCTAAACTGAAAAATAA.GATAATAGCAATAAATGCTTGAATAA a909_al2.seq 

G T T T A C T A T T T T G A C G A G A T A A G A T T A G T C T T T T T A T A f C T T T C T A A T A T Majority 
• 1560 1570 1580 1590 1600 

.1551 CTTTACTATTTTC.ACGA5ATAACATTACTCTTTTTATATCTTTCTAAT.AT cohl aI2 seq 
1551 GTTTAqTATTTTGACGAGATAACATTAGTCTTTTTATATCTTTCTAATAT a909_al2;seq 

TGGCAAACAAG CCACCTAAGTTAGA T.A GAAAACAATCGAAATTAAAATTC Majority 

> 1610 1620 1630 1840 1650 

1601 TCGCAAACAAGCCACGTAAGTTAGAT.A G A A AACAATCGAAATTAAAATTC cohl_al2 seq 
,1601 T G G C A A A C A A G C C A C G T A A G T T A G A T A G A A A A C A A T C G A A A T T A A A A T T C a909_ai2.seq 

CCTCAACGA T A TTAAATGGAATAACCATTCTTAAAAGGTAATTfiCCTA C-A Majority 
' 1660 1670 1680 1690 1700 

1651 ;C C T C A A C G A T A T T A A A T CCA ATAACC.ATTGTT AAAAGGTA ATTGCCTACA cohl ai2 seq 
-1651 CCTCAACGATATTAAATGGA'ATAACCATTGTTAAAAGGTAATTGCCTACA a909_ai2!seq 

. CCAATAAATGTT CTGATATCAAAGTTAGCAAATATA.GCATACAAACCAAT Majority 

171° 1720 . 1730 1740 1750 
1 1 t— . 1 



1701 CCA A T A AATGTTCTGATATCAAAGTTAGCAAATATAGCATACAAAGGAAT cohl_.ai2.seq 
1701 CCAATAAATGTTCTGATATCAAAGTTAGCAAATATAGCATACAAAGGAAT a9,09_ai2.seq 

C G C A A AG A C A T A G T T C A G A G C T A C C A T A G A T A CA G T C A A GCTAACTGTAC Majority 

- ■ 1760 /1770 • . ■ 1780 179 0 1800 

.1751 C.C C A A AO AC'ATAGTT G A GAG C-T A C C A T A G A T A C A G T C A A G C T A A C T G T A C cohl_ai2.s_q 
175|* C.C C A A A G A C A T A G TT GAG -A C.C T A C CAT A G A T A C A G T C A A G C T A A C T G T A G a909^ai2.seq 

C A A AT A A A C T A G C T T-T.A A T,A; A A A T'CT T T T C C_A C T C TC T CUTTTTT C : C AG Majority . 

•.. -IMO .1820 . . 1830 *. • 1840 .1850 

1801 C A A A.T A A, A CT A GC TH A AT A A A A TC T TT T G C A C T CTCtCT ATT.TT TC C AG cohl ai2 seq 
1801 C A A A T A A A C T A G C T T T A 'A T A. A A A T G T T T TC C ACTC TC TCT AT TT T T -C.C A G a909^ai2]seq- 

* — . • 

. A A A A T A G C G A A A C TT G C T A A A A A T A.G AG C T A G A GC A A C C A T ATTCATCG G Malorltv 

I860 ." 1870 1880 1890- 1900 

1851 A A A A T A G C G A A A C T T C.C. T : .A A A A A T A G A G C T A G A G C A A C C A T A T T C A T C.G G cohi_ai2 seq 
1851 A A A A T.A G C G A A A C T T G C T A A A A AT A G A C C TAGAGC AACCATATT CAT C G G a909_ai2.seq 

. XAAACCGATAA AGGTTTCTGGACCACGATTAGCAAGTATAACTTTTAAAA Majority 



1901 
1901 



______ 1910 1920 1930 1940 1950 

TAAACCGATAAACGTTTCTGGACCACGATTAGCAAGTATAACTTTTAAAA cohl_ai2.seq 
TAAACCGAT A A A G GTTTCTQ GACCACGATTAGCAAGTATAACTTTTAAAA a909_al2.seq 
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GTGk T P1:5 1V gM TSATTTCAAATCAAATA A __A__A Uajorlty 



1951 
1951 



2001 
20OI 



2101 
2101 



1 ^ 60 1980 1990 . 2000 

G T G A T C T T A A T A A C A C T A C A C C A T A A C T T G A T T T C A A A T C A A A T A A A A T A cohl_ai2.seq 
GTG ATCTTAA TAAGAGTACACCATAACTTGATTTCA A-A TCAAATAAAATA a909__al2.seq 

AAACCAACTAACATCGCA ACGATTCAAAAATCAACCTTTAAAAATTCTCC Majority 
• 2010 2020 2030 2040 2050 

AAAGC^AACTAACATCGGAAGGATTGAAAAATCAACCTTTAAAAATTCTGC cohl_a!2 seq 
AAAGCAACTAACATCCGAAGGATTGAAAAATCAACCTTTAAAAATTCTGC a909_ai2.seq 

' . TCCTCGTATT AATG'CAAATC AAACCATC ATCAATACAAAAC AT A A C G C A C Majority 

2060 2070 2080 2090 2100 

2051 'TCCTGGTATT AAT--GGAAATGA. AACCATC A T C A A T A C A A A AG A TAAGG C AG cohl al2 seq 
2051 -TCCTCGTATTAATCGAAATGAAACCATCATCAATACAAAAGATAAGGCAG 3909^12. seq 

A A A G AATGGC GATTGTCAC CATTTTACGTGTATTTGTCATAAAAAAATTC Majority 

J f*^ 2120 2130 2140 2150 - 

A\A AGAATGGCGATT. GTCACCATTTTACGTG T-A T T T G T C A T A AAAAAATTC cohl_jal2 seq 
AAAGAATC-GC GA T-T G T C A C.C A T T T T A C GTGTATTTGTCATAA A AAA A TTG a909_.al2.seq 

C T C C A A TTT AAATAAATTGAAAGAAGCTCCAAAGGTAAGCGTAGGTACGC Majority 

21 t 6Q 2170 2180 2190 2200 

CTC C AATTTA AAT.AAATTGAA.AG AAGCTCC-AAAG.GTAAGCGTAGGTA CGC cohl__ai2 seq 
CTCCAATTTAAATAAATTGAAAGAAGCTCCAAAGGTAAGCGTAGGTACGC a909_jal2.seq 

GAAA AAAACC TTTCTCT TCTCC C" A T C C A GAG'TTTACTG TCGC T T G T G G A A Majority 
. 22/° , 2220 2230 2240 2250 

G A A A A A A A C C TTT GTCTTCTCCCATCCAGACT.TTACTGTCGGTTGTGGAA cohl__ai2 seq 
GAAAAAAACCTTTGTCTTCT C C C AT CCAGACTTTA C T G TGGGTTGTGGAA a909_.al2.seq 

-TCTCACCACATCAGCT.TT CG.CTCGCCCACTGATGCTTCACAACTGAC.AAA Majority 
. 226Q ■ 2270 . . 2280 2290 "2300 

TCT C A C C A C A T C A G.G f T T G G CT C G C G G A C T G A T 0 C T T C A C A A C T G A C A A A cohl.jal2.seq 
TCTCACCACATCAGCTTTC.GCTCGCGGACTGATGCTTCACAACTGACAAA a909_al2.seq 

TAACTTGGAAG CCATTACCCCCGGTCGGGA ATT A C A C C C T G C C C T G A AG A Majority* 

23 lO 2320 2330 . 2340 2350 

' JL 



2151 
2151 



2201 
2201 



2251 
2251 



2301 
230I 



2351 
2351 



2401 
2401 



T A AGTTGGAACCCATTACCGCCGGTCGGGAATTACACCCTGCCCTGAAG.A COtU_aI2.seq 
TAAGTTGGAAGCGATT. A C C CCCfi CTCCB C A ATTAC AC CCTCCCCTC1 AC A a909_al2.seq 

CACCT.ATAGCATAACAAAAAAAACTTGCAATTGCAAGTTTTTTAATCACT Majority 
■ 236Q 2370 2380 2390 2400 

CACCTATAGCATAACAAAAAAAACTTGCAATTGCAAGTTTTTTAATCACT cohl_ai2,seq 

CACCTATAGCATAACAAAAAAAACTTGCAATTG.CAA G TTT TTT A A.TC ACT a909_ai2.seq 

■ 

AATTACTAGTAGATTGTA TAATATTAATTTTTAACATCAATTAATTGAC.A Majority 

i i r~ y- m ■ i ' r~ J , J 

, . 241Q 2420 ,2430 2440 . 2450 

A A T T A G T A G T A G A T T G T A T A A T A T T A A T * T J T A A C A T C A A T T A A T T G A : C- A * c6lil__aA 2 . seq • 
A A T T A G T A G T A G A T T G T A T A T T A AT T fT T A. AC AT C A ATT A A T.T G A C A a909 ai2-seq 

G C G C A C-T A A T,A C T C T A C C T A C T C C .TG.CC t TT.C T A C A A G T A A A C A A G C T T A Majority 

\ - ' 2 - 4 f Q ' , * 2 y° ' 2 * 80 . " • ' • ■ 2490 2500 

245i vG C G C A C T A A T A G T 0T A G C T A C T.C C T G C C T T T G T A C A A G T A A A C A A G C T T A cbhlviil2.seVi' 
.2451 GC G C A C T A A T A C T C T A GC TACTCCTGCCTtTGTAC A AGTA-AACAAGCTTA a9Ctt.jai2.sea. 

A G T C C C A A T G A T T G T C T C A TG T G G C A G T T T T.A T A A A C TTTTT C A A T C GC T Majority 

r-. i ■ i : - 1 : : ; r tt j j 

* 2S }° . 2520 .; 2530 • 2540 2550 

2501 ACTCCCAATC ATT-GTCT'G ATGTGGCAGTTTTATAAACT T.T TTCAATCGCT cohl_a!2.seq 
2501 AQTCCC AATC.ATTGTCTG AT GTG G C A G T. T T T A T A A A -C. T T T T T C A A T C GCT a909_al2.se^ 

GTT'GGTTCAATA ATTTCTCTATTACTGATTTTGTAGTGATAG AT T T G C C C Majority 
. ' 2560 2570 2580 2590 260Q 

2551 CTTGGtTCAATAATTTCTCTATTACTGATTTTGTACTGATAGATTTGCCC cohl_.al2.seq 
2551 CTTGGTTCAATAATTTCTCTATTACTGATTTTGTAGTGATACATTTGC.ee a909^a!2.seq 
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2601 
2601 



^" C "f ' T ' G f A "fi® ^'pVty *T CC CAT ATCTACATTTTTTAAAC 

J| <r 'Iml' •tfiil* 'IkiII Mini" «' ^l>i«;r .1* !i»m>i intjl" .if' i | — ■ ■ - ■ . . j — -J * *~J 

26 . 10 2620 2630 2640 2650 



TrTT?T^ C ^r^ UUTAAACATCCCTTCCCAnT ^A CATTTTTTAAAG a)hl ai2 sea 
T G T T G T A G T T G T A A A A T A A A C A T C C G T T C C C A T A T C T A C A T T T T T T A A A G SS[£ 1 2 ^ 

CATCAAAATGATAAGGAAAATTATGCCC A CAAATCACCATGTTATTAGTT Majority 



2651 
2651 



r aI r J a a a ^ATAAGGAAAATTAtGCGCACAAATCACCATGTTATTAGTToohl a!2 sea 
CATCAAAATGATAAGGAAAATTATGC GCACAAATCACCATGTTATTA G T T SS^!*^ 




2701 A A A T A A G A A C C A.T A A T A GC TT G T A G G C G t4 
2701 * * * " * " 1 ~ - 



& a a t a a r. a w» a ^ « , _ - -«-. T T A G A C A G T T G.T TCAAAACT cohl al2 sea 

AAATAAGAACCAT. AATACCTTGTAGCCGTTTTAGACAGTTGTTCAAAACT tfCMLMM 

ATAATTAGCAGCTACCGGTAAATGCAGTTT TAAGTTCCGAATATCCAGAG Majority 



2751 
2751 



A^TaTTaT^ cohI_ai2,sea 
ATAATTAGCACCTACCGGTAAATGCAGTTTTAAGTTCGGAATATCCAGAG a909_ai2.seq 

TTCCCA ACTA AT CTGTTT TATCC AACTTTTTTAC ACGTA AT TCTCC ATTT Majority 



2801 
2801 



2810 



2820 
JL 



2830 



2840 

JL 



2850 
J. 



2851 
2851 



11 C C C i J GTA ATrTrTTTTiTr CAACTTTT T-T A C A G G-T AATTCTCCATTT. cohU.12.seq 
TTCCCAAGTAATC-TGTTTTATCCAACTTTT T T ACAGGTAATTCTC CATTT a909il2.seq 

T C T G A A C C C T T T A C T T G A T G C G T A A T A G A T T T A T C A A C C G C C T T C A C A A T Majority 

28 . 6 ° J«g ' 2880 2890 2900 

TCTnACCCTT^C^ cohl_ai2.seo 
I* X GAACGCTTTACTTGATGCGTAAT-AGATTT. ATCAAGCGCCTTG AC A A T a909_ai2,seq 

A T G C T G A G A A G T T A A- AT GAG C TT C A T G C G C C T G A T T A A T A T T A TA.CCAAf. Majority 



29 lO 



2920 



2930 



2940 



2950 



2901 ZA TC C.TG A C A A C TT.A A A TC A G CTT C AT G C C C C T C A T T A A T A T T A TA C c A A r „ 
2901 A TO C T G A G A A G T. T A A A T CA C C T T G A T G C G C C- T G AT T.A 'lATTATKCilc S^all^ 

CCCAATAGATTCCAGAACTTACCAGAATC.A T TCCGACTATCGCTAAAAAA Majority 



2960 



2970 



2980 



2990 
i 



3000 



2^11 X CCUt'Tg^t^^^ cohl_ai2.se q 
2951 CCCAATAGA.TTCCAGAACTTACCACAATGATTC cfl A CTATC GCTAAA AAA a909lai2.seq 

TTTGCTCAATATCTTCTAATCACGTCTTCTT CTCCATTTTAAGGCTATTA Majority 

3010 r 



3020 
j 



3030 



3040 



3050 

JL 



39 01 T'TTCCTCAATAT'CTTCTAATC'AC CTCTTCTTCTCC A TTTTA aK I pptatta '^m 
rmni TTTrr-rrA ATAT^-r-T-^-r-* ^ / 7 ^ x . ill A Affl (* C T A T T A cohl ai2.seq 

3001 TTT. GCTGAATATCTTCTAATGACGTC TTCTTCTCC ATTTTA AG GC T A T T A a909lal2.se^ 




3101 
-3101 



3151 
3151 



T G-T C C AG T A A A C G.gHa GCTTTC C CTTTG TC T G A T G T G T T AC T G T A C C A r t ^ki » ~, 
TGT CC AGT A A AC G-GG AGCTTTCCCTtf T G X T G T^ GjJ T AC'TGT A G G ACT So9Zal2*seq 

A ATTCTCTG.TTGT TTTTTA GGTTTA.GC ATTT A AAGGGCTCATTT.TCTCA A Majority 
: gfi 3 * 7Q 3180 . 31.90 . 3200 

t a H ^ I ^ T T C T T ^ ^ T T A G G. T T T A G C A T T T A A A C C G C T C A T T T T C T C A A coh L al2 sea 
A A TBI GTCTCT.TCTTTTTTAGGTTTAGCATTTA AAGGGCTC ATT T T C TC A"A. a909_ai2!seq 

ATCCTGTAATATCCTACTTGCCATCCTTAC G TATTGATAGTATAAAGGGA Majority 



3201 
3201 



3210 



3220 



3230 
— x. 



3240 

JL 



3250 

JL 



•A?rr?r?M?^^^ CTTCCCATCCTTACGTATTGATAGTAT AAAGGG A coh. ai2.se, 
ATGCTCTAATATCGTACTTCCCATCCTTAGGTATTCATAGTATAAAGGGA 3909^12;^ 
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3251 
3251 



3301 
3301 




CCC ^r CA ^ A !f!^ T SI TCCCAACAATAC " TCUCCGG ^AATTATCAl cohl a 12 .seq 
CCCTTGAGGAAGATTGTTCGCAACAATACCTTCAGCCGGTAAATTATCAA a909~al2.2a! 



3351 
3351 




3501 
3501 



S f^C AtCTTT AGC TTGTGCTACTTG Gt A A AC A A GTAAATT AC CGC C A GCG cohl a!2 sea 
CAACATCTTTAGCTTGTGCTACTTGGTAAACAAG T'A A A TT ACCGCCAGCG SStaiL^ 

A T A C C T T G A T T A T T A TACT T AT T.TTGT AT ACT A AT AC A A CC CCTTTTC AT Malor-irv 
. j^gg . 35 / 0 3580 3550 3600 

3551 ATACCTTGATTATTATACTTATTTTGTATAGTAATAGAAC.CCGTTTTCAT a909_al2.seq 



CTGATC ATTCG T.ATC A G'C A -C A C AC A AGTTGA GTACTTAG AC.TAAATAATA Majority 

3630 . 3640 3650 



3610 



• 

3620 



■S£ ITIITcTtTIII^ cohllaia.seo 
3601 CTGATCATTGGTATCAGCACACACAAGTT.GAG.TACTTAGACTAAATAAT A a909_al2.seq 

AGAGAA GAGTTATC TTTAGGATCTTTTTATAAATCATTCTTCTCTTCCTT Majority 

36 . 60 -3870 3680 3690 3700 



3651 



occl A J A CAACACTTATCTTTACCATCTTTTTATAAATCATTCTTCTCTTCCTT cohl aI2 sea 
3651 AG AG A AG AGTT A«TTTACC ATCTTTTTAT A A. A T C A T T G T T C T C T T C C T T Sw£lLt£! 

TCTC ATTGC TTGTTXTA A A ATTT TC TT AC G .TTG ACGTGCTC.TCCT ACTT A Majority 

371t) . 3720 . * 



3730 
i. 



3740 



37S0 
i 



T C T C A X • X C G T X C X T- X.T A. A A A X T- XX CilAGG TT G A C. G X G C X C X C C X A 15 $ A 

C T T C T A A A € A G A T T A A A- ACfi T/A AA A T C.A A ACTA A C G A A A A TAG C G A T- A A A T* Majority 
• 5-. 3760 -3770 . '* \W • * ; . ZIL 



.3770 



3780 
1: 



. 3800. 

-L, 



3751 V P T.T CTAAAGAGAT t.A A A AG T A A A AT C A A A G T A A GG A A A AT A GCGATAAA T cohl a i2 ^ 
3751 CTTCTAAAHaGA-TTA-AAAGTAAA^TCAAAGTAAG.GAXa^ 

GGTGC. GAT ATAAATAGG CTCTATTTGTATTGCCTCTGCTACTACCAAAGC Majority 

3810 3820 - 3830 " . 3840 " . 3850 

J_ j . f. ■ _l 



3801 
3801 



-A. 



S $ 1^ 5 G A T A T A A A T A G G C T C T A T T T G T A T T G C C T C -T G C T A C T A C C A- A A G C cohl_al2 sen 
G G'T G C G A T A T A A A T A GGCTCTATTTGT A'T T. G C C T CTGC T A cQ ACC AAAGC a909Zai2!"seq 



CTTACCA TTA'T.CG 'TTT G G T A C A C G. AT G T C CTC t C A C TA CT A A C C G A TG G G Majority 
' , - 3 y° 3870 3880 3890 . 3900 

^ QCt G " AC $ ATTATCGTTTCCTA CACGATGTCCTCTCACTAGTAACCGATGGG cohl^ai2 sea 
3851 CTTACCAT-TATCGTTTGGTACACGA TGTCCTCTCACTAGTAACCC ATGGG ^09^12^ 



3851 
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t a t t p r yurcx j |g f-=fc j^j gEj ** caaactttggtactctttacct 1 



3901 



S 9 . 10 3920 3930 394Q 3950 

TATTAACGCCATATGGTGTACACGTCACCAAAGTTTGCTACTCTTTACCT cohl_ai2.seq 



3901 T A T T A A cfi C C A T ATC GT C T A C A C C T C A C C A A A G TTTC G T A C T C T TT A C C T aflOd"^^ 
TTAAC A ATTTCTAAATC CCTCAAATCATCCCCTTTAACTGTTCTCATTTC Majority 



3951 
3951 



3960 3970 3980 3990 4000 

zrh » . *— — j — i . 



40O1 
40OI 



TTAACAATTTGTAAATCCCTCAAATCATCCGGTTTAACTGTTCTGATTTG cohl ai2 sea 
TTAACAATTTGTAAATCCCTCAAATCATCCGGTTTAACTGTTCTCATTTG aMSLtkaiseq 

AT CCA CTTGATAAGTAT ATGTTTC ATTTAA GATAGTC ACTGT CCAGTGCT Majority 
; . 4 °. t0 4020 4030 4040 4050 

ATCCACTTCATAAGTATATGTTTCATTTAAGATACTCACTGTCCAGTGGT cohl ai2 seq 
ATCCACTTGATAAGTATATGTTTCATTTAAGATACTGACTGTCCAQTGGT a909~al2.seq 

CTCCACCTTTTAACTTATC CAAATCAGAAA. A A AG CC TTC A A C A G GC T A A A Maloritv 

4060 4070, 4080 4090 41*00 

' J : • « — L. 



4051 C T CCABC-TTTTAACTTATCCAA AT CAGAAAAAAGCCTTGAAGACGGTAAA cohl al2 seq 
4051 C T C C A C C T T TT A ACT T A T C C A A A T. C A.G A A A A A AG C C T t C A A C A G G C T A A A a^Ial^ 

CCTCTATGTCCTGATAAAA TAGAATG. AGTTGAGTCTCCTCCA, ATTGGAAG Maloritv 
• - 4110 4120 4130 4140 4150 

4101 XJC-TCT ATGTCCT GAT A A A AT A GA A I G A G TT C A CT CT CC TC C A A TTGC A A G cohl ai2.seq 
4101 C C T C T A T G T C C T G A T A A A A T A G A A T G A G T T G A G T C T C C T C C A A T TGG A A G a909Zal2.seq 

ACTA GT T C C T TC TAAATGACC A A TAGAAGTTTCAAG C A C TTTTTCACTTG Majority 

4 * 6Q - 4170 4180 4190 4200 

j 151 ACTACTfCCTTCTAAATGAGCAATAGAAGtTTGAAGCACTTTTTCACTT G cohl al2 seq 
4151 ACTACTTCCTTCTAAATGACCAATAGAAGT T T C A A G C A cQt TTTCACTTG a9oCal2!seq 

■ T A C C A T G A T A A A GTG GT A A T TTTA T G T TT A TC TT TG G A ATT GAAATATAA Majority 

.. . 42 t '° 422Q 4230 4240 4250 

4201. T A C C A T G A T. A A A GT G G T A ATTTTA T G TTTATCTTTGG A.ATTG A A ATA T A A cohl a 12. seq 
4201 TA CCATGATAAAGTGGTAATTTTAtBtTTATCTT.TGGAA TT G A A A T A T A A a909lai2.seq 

a * *■ 

CCCAT ATTACCC GTTTTAT CGATA GC CAGTTGTG A A T T A T A A T: C C A A A C G Majority 
. ; 4260 427Q . 4280. ' 4290 * 4300 

4251 CCCATATTACCCGTTTTATCGATAGC-CAGTTGTGAATTATAATCCAAACG cohl a 12. seq 
4251 CCC ATATTACC CGTTTTATC GATAGCCAGTTGTGAATTATAATCCAAACG a909_ai2:seq 

OTCTTCCTTAGT CAT G-T GCCACTTCATTCCTCAAGTTTTAAATTGCTTAT Majority 
. 43*0 4320 , 4330 4340 4350 

4301 CTCTTG C &T * GTCATGTGCC ACTTC ATTCCTCAAGTTTTAAATTGCTTAT cohl ai2 i seq 
4301 C T C T T G G tO A G T C A T 6 T G C C A C T T C A T T C C T C A A G T T T T A A A T T G C T T A T ^909^12,5^ 

TATATTCTTTGG C.TCGGTTA AT H TTTTTTT ATAGTCCTTTTC ATCC AT A Majority 

■ 4360 4370 . 4380 4390 ' 4400 

f 51 .T ATA *I T G T T,T G GC TCGGf T;A AT A A TT T TTT T A T >H'T C .G T t T.TCATC C A T A cohl__ai2.seq- 
-4351 TAT AT T C TT TIG GC.TCG-G TT ft A X A A X TT T-T T TA TAGTCG TTt.t C AT CCA t. A a909_a£2.seq 

T G C G T T A CC C G g' tCT'J. GGT A: A T.C CAT A A T C C € T C G A G ATTGGT G A A A T. G A Majority 
- . ' 4410'. .- 4420 4430 . 4440 * l 4450* 

4401 T G C G T T A C.G C G G T C T f G G t A A T C C A T A A T C G C. T C G A G A T T G G T G A A A T G A eohl_ai2.se<i 
4401 TCCGTTACGC G.G T C T-.T G.GTA ATCCATA ATC G C T C G AG ATtGGTG A- A A T G A a909_al2.seq 

ATTCCAATAA T-T AGGAAGTGA AG G ATAAGCC ATTAA- GCCTACC C C C A C T'G Majority 

.'* • • 4460 4470 4480. " 4490 4500 

'4451 A T T C .C A A T A A T T A G C A A G T G A A G G A T A A G CCA T T A A G C C T A C C C C C A C T C cohl ai2 seq 
4451 ATTCC A ATA ATTAG.CA AGTG AAGG ATA AGC CATTA AGCC TA CCGCCACTG a909i.al2]se5 

C A A T T A T A G T G A C A AGCA A A A T G G ATACTAAATGTTGTCTTATTTTTTTC Maloritv 
4510 4520 4530 4540 4550 

4501 CAATT AT AGTG AC AAGCAA A ATGG AT ACT A A ATGTTCTCT.T ATTTTTTTC cohl_al2 seq 
4S01 CAATTATAGTGACAAGCAAAATGGATACTAAATGTTGTCTTATTTTTTTC a909_ai2.seq 
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4551 
4551 



4600 

JL 



ATATATTTTAAATCTCTACC ACTTTGCTACCC CATT TT ATT a a r a trrrl ... 
ATATATTTTAAATCTCTACCACTTTCCTAGCCCATCTTATTAAfiAACCTA a9TO_ai'2[seq 

Majority 



4601 
4601 



4651 
4651 




4701 
4701 



AGTACCTTTGT. TATTTT CAACA GT TCG C T'T AACTAAAACCTTATCTCA at Haj ority 

4710 4720 4730 4740 • " " 4750 




4901 
4901 



CTGTTGCATTAGCTTCTGTGCCCCATTCAACGTTAT TTGTATCCTTAA a a. m<>rlty 

4910 4920 * 4930 4940 . 4950 



4951 
4951 



5001 
5001 



5051 
5051 



sioi 

5101 



5151 
S151 



TTTAGAAATTGACCCGTAGCATTCTTTAAAAC A AATATAGCACCTTGTAA Majority 

4960 49 ™ 4980 4990 5 fJoO 




5050 



T C A A G C T T T T G T G. C A A C C A T C A A T T T T T T T T A T A G T A A T t T r A r r a t r r n JO 
TG-AACPTTTT^ T'r- n a a #» V. * ~ V ?■« ~ - - - * • A •*• • A C C A T C C - C eohl_al2.seq 

^ • * ? C T T T T G T .G G .A A CC A'T'G A A T T-T.T T T-T TATA G T A A TTTGAC CAT 6 C C a909_ai2.s2 




,G T C <_ C A A T G T T T G T-A T.T T T C T G CT A A A T C A G C T G A A C C T GCTXT A CC TCC Majority 

5110 5120 S130 5140 . " ~^_q 



G TCCCAA TG T T T r t a t-t tt^J^p^^ A - AT C A G £ T G A AC C T G G T T T A GC T.C C cohl_.a_2.seq 
G T C C C A A T G T T T G T A TT T T C T C G T A A A T C A G C T G A A C C T G G T T T A G C T C C a909la!2.^ 

ACTCTTTAATACtCCTGTATAAGTGACT C TGATTGTATTTATTCCCTTAT Majority 

5160 5170 - , 5180 --- v— 



5190 



5200 



ACTCTTTAATA-CTCCTGTATAAGT-GACTGTGATTGTATTTATTCCCTTAT _309_al2.s_J 
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Weighted residue weight table. 



n% fan 




5201 
5201 



5251 
5251 



lA\ A ^!!^r! T T ^ TTAGCTCCATTTTGAGTATTTCCG ^TGGACTATTicohl ai2 sea 
AAAAAAAGTCATCATTACCTCCATTTTGAGTATTTCCCGTTGCAGTATTC SSJjSI I^J 

GTAGCTGCCCACGCAATA CTAATCC TGAAATTATTATTTTCCTCTAACAC Majority 

5290 5300 



5260 



5270 



5280 



GTAGCTGC CC AC C C A A T A GT A A T C G T C A A ATT ATT ATT TT CC TC T A A f A r 

GTAGCTGCCCACGGAATAGTAATCGTGAAATTATTATTTTC C TC T A A C A G a90^~at2]seq 

GTTATACTTCCCAGTTGCTTTTTCCGAACCTTG AGTTAGACTtGTAATAT Majority 

5310 — 1 



5301 
5301 



5320 
j 



5330 

JL 



5340 



5350 



GTTATACT T.C CCA CTTGCTTTTTCCGAACCTTGAGT TAGACTTCTA A T A V ^ut ,o 
CTTATACTTCCCACTTGCTTTITCCCAACC^TStSI^SiS?^;!"!; SS^S^ 

TCCCTGATCCATCAGTAATAGTTACTTCAT AAGATCnTTCGTTCAAATCA Majority 
* ■ • 53 . 6Q fJTO 5390 5400 

5^} TCCCTGATCr l^ C ^ C x A ^h TTACTTCAlAAGABCC TTi;GT T^AATcl cohl al2.se, 
5351 T C C C T G A T C C AT C. AG T A A T A G T T A C T T C A T A A G A T C C T T C G T T C A A A TC A a9 09^12 sSJ 



5401 
5401 




ACTAC. AGAAGCAGATGGCATAGTATCCTTTATAACATATTr A T a *r- a r ttt u« *o 
AXTACACAAOC.ACAT.ee AT A CTATCCTT?i?M-cl?A??CA?icA"??-S!^L-S 



5451 
5451 



T T C T G T A C C A T G A T A Ta T T G A C T G C A T T CT T A T A A G T A A T A C T h t a t n \ ,o 
TTCTGTA'-CCATGATA.ATTGACTGCATTCTTATAA-G T'A ATAGTATATTTG A' a909Zai2*seq 

C-T C T A T C A C C A A C C G A G T A C. G T T T-T T T G A T C T A C A G T T T T T C C A C C A C C. A Majority 

5510 5520 5530 . 5540 5550 




5551 
5551 



5601 
5601 



5651 
5651 



5701 
5701 



5751 
5751 



5801 
5801 



^^^^ ATGTCGCATCAGTATTCT TTTCATCAATAGTAGCATTTGGAGTcohl aI2 S ea 
TCTeCCCATGTCGCATCAGTATTCTTTTCATGAATAGTAGCATTTGGAGT SSfcjSl ^ 

TACAGATGTAACCATAAT TACAG'C TCCATTATTAACAGTGCTAGAAA. CAT Majority 



5610 



5620 



5630 



5640 

_L 



5650 



I A ^ AGA J GT A A C C A T A ATH.A C A C C TC C A TT ATT A ACAG TC CTA C A A AC AT cohl ai2 sea 
TACAGATGTAACCA. TAATTA.CAGCTC'CATTATTAACAGTGCTAGAAAtAT a909Za£2]seq 

AATAAfATCCA TATTGGG AAACATTAATAACC.TCAGTACCATCATTATTT Majority 

^ 56 ?° . , VSyO » , 5680 • . • ' 5690 ' 5700 

A A T A A T A TCC A T A ? T G G G A A A C A T T A A T A AC- C TC A G T A €UWAT TA TT T cohl ai2 sea 
A AT A A. T A T C CAT A T -T GGG.AAAGA T T. A A T A.A C CT C A; G f A.CC ATCA Tf A.TTT T^lzfil'^ 

G A C T C A GT A AC AC T C G A A A C T G GT C T A C T A TT A C C j G At A T A G.A T T T A G C Majority 

. S ? 10 ' ". S? 20 ^ .-^730 , S7; 40 ' " ' 5750 

GAC^r^rT Mr^r^ 'cohlLai2.seq 
GACTCAGTAACA G T G G A A A C T G G T C T A G T A T T A G C T G A T A T A G A T TT A G C a909lai2.s^ 

C'CATGTCCCAAT CT CATTT-CCTGA C CC A G T -A TCTTTTT TAc'tTACAT A.TC Majority 

■ ' ' - S y° 5770 ■ m 5780 5790 5800 

C CATGTCGGAATCTdATTT.GCTGACGCAGTATCTTTTTTA G-T TACATATP ™hf ™ 
C C A T'G T C*6"C A A'T CT-CATTTG CT G A C-G CAGTATCT T T*T T T A G*T T AC ATATG *a9X)9^3lZ^seq • 




I TfZfWlT^ c6hl_ai2.seq 
TTCTCCCTCCATTAGTAGTTGTCGTAAAAAGAGAATTAAAATCAGTTGA A a909_a!2.seq 
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5851 
5851 



5901 
5901 
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f*?g S8 7° 5880 5890 590G 

CCTTTATArTrirrTTrTT**r^ cohl_ai2 .seq 

CCTTTATACTCAGCTTCTTTACCTTGAGCAATTAAATAAGAAGCTCCATC a0O9_.al2.seq 

TTTATTCCAATC ACA TACATTTGCATTATCTATTTCTGCATCAAAAACTT Majority 
f 9 ™ S920 5930 5940 5950 

11 l A * ^CGAATCAGATACATTTCCATTATCTATTTCTGCATCAAAAACTT cohl ai2_seq 
TTTATTCCAATCAGATACATTTCCATTATCTATTTCTG'CATCAAAAACTT a909_al2.seq 

TGTATGCTTTATAGCTTC C GCCTTTTTGAGTATCTTGAACTGTAATTGTC Majority 

5 y° . 5970 5980 5990 " 6000 

If] ATGCTTTATAGGTTGCGCCTTTTTGACTATCTTGAACTGTAATTGTC cohl ai2.seq 

595 1 TGTATGCTTTATAGGTTGCGCCTTTTTGAGTATCTTGAACTGTAATTGTC a909lal2.s^ 

CCTGTCTCAG.CGGCAAAAG C TATCGCCCTAACT G G TCATACAGCCATA C C Maloritv 
f°^g . gOgO 6030 6 040 6050 

5?IS TCTCUCG&CAAAAGCTATCGCC GTAACTGGtGATACAGCCATACC cohl__ai2 seq 
C C T G T C TC A G C G G C A A A A G C T A T C G G C G T A A C T C G T G A T A C A G C C A T A C C Sooii^ 

AAATGCTAAACTCGCCACTAACAGC CATTOAATCATTTTCTTTTTCATTG Majority 
__. gggg • 60J0 . 6080 6090 6100 

AAATCCTAAACTCGCCACTAACACCGATTGAATCATTTTCTTTTTCATTG cohLai2 seq 
AAATGCTAAACTCGCCACTAACA. GCGA T.T.G AATC ATTTTCTTTTTCATTG- a909^al2!seq 

AAATCTTTCTCCTAAAATC ATATTGATGAATGATTAATTCATATTTTTTT Majority 
. 6t t 10 61 20 0130 . 6140 6150 

A A A T CTTTCTC CTAAA ATCATATTG ATGA ATGATTA ATtC AT A T T T T t T T cohl_ai2 seq 
AA AT CTTTCT.CCTAAA ATC.ATATTGA. TGAATGATTA A TTGA T ATT T T T T T a909_al2!seq 

T C G A T A G T A T A A T ATT AATC C T G A T G G T A G A G C T A A A G C T A A A C C A A C T A Majority 

. . _S£ fj!_! G \ m 6190 6200- 

*l Sl TCCA T A G T AT A A. T AT t AATC CTGATGGTAGAGCTAAAGCTAAAC C A A C T A cohl ai2 seq 
6151 TCGATAG TfA T A ATA T T A A TC CTG AT.GGTA G AGC T A A A GCT AA ACCAA.CT A ^0^Zai2*seq 

GG ATATA A ATGT;GTCTTC CA ATACCTCCA'CT ACT ACG'C AATTC TCTTCCT Majority 

6210 6220 6230 6240 6250 

-* 1 i_ 



6001 
6001 



6051 
6051 



6101 
6101 



6201 
6201 



6251 
6251 



^^T^f AATGTCTGTTCCAATACCTCCAGTA ^ cohl" ai2.seq 

G G A TA TvA A A'T GTGTGTTCCAATACCTCCAGTACTAGGCAATTCTGTTCCT a909lai2.seq 

TTACTGTTAGTAATTTTAA A AGTATATACTGTACTTCCATC T.AX T A A ATT Majority 
6260 6270 6280 6290 6300 

TTACTGTTAGTAAtfTTAAAAGTATATACTGTACTTCCAT cBa CTAAATT cohl ai2.seq 
TTACTGTTAGTAATTTTAA A- A G T A T A T A C T G T A C T T C C AT C T A CTAAATT a909lai2.seq 

C T C T T T T'A TTGG TGTCGCAT TATTACCATTTTGTTCA A AGGTAACTC CC G Majority. 

J g^g . . 63g? 6330 6340 6350 

SI C TC T T TT A TT G;GT G T C GCATTAT.T A CC.AT t T T -G T T C A AAGG TAACTCCCG cohT__ai2.seq 
6301 CT C T T TT AT T G.-G-T G T C G C A T T A T T A C C A T T T TJT'tT'C A A A G G f A ACT CC C € a909ii2.seq 

T AG-AAA TC, AC T A A TACT.GA T AT A TIC A f TT T T AG C. J. ACTA GO TA C C..CT CCA Majority 
' ' :6? . 6 ° '-. 6 ^ 70 - ' e^ 80 6390 . 6400 

T AC A A A TC ACTAATAC'TG ATATATC ATTTTTAGCTAGTAGGTACCCTGGA cohl ai2.seq 
T A G AA.AT.C ACT A A T A C T G AQ AT A T C A T T T T T A G G T AGTA.GGTACCjCT.GG A a909„ai2.seq 

GGG GCCTTT.GfCTCTGTTAGGTAG T AT T T T CCTA'CTGGC U AC.TG A GGT A Majority • 

64 f "> ■ . . 642Q ; . " 643Q , . . 6440 ' , 6450 

££1: G £ C ? G T C TC T G T T A GCT AGT ATT T'TCCT ACTGGCA AACTG AGGTA cohl al2.seq 
GGGGCCTTTG.TC T C .T C T*T AGGTAGTA T..T TTCCTAC T G CCA AACTG AGGTA a9093al2.seq 

G T T A TT A.C C A T C C. A C T A A T A A C A A G C C T TT A T C G T T T G T C A C C A GC CCTG Majority 
' 6 y° * . 647Q 6480 6490 ' ' 6500 

GTTATTAGCATCCACTAATAACAAGCCTTTATCGTTTGTCACCAGCCCTG cohl ai2-seq 
G T-T ATTAGCATCCACTAATAACAACCCTTTATCGTTTGTCACCAGCCCTG a909lai2.seq 



-6351 
6351 



6401 
.6401. 



6451 
6451 
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6551 
6551 



6601 
66Q1 



A A T A 
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ATTAG CATCTGATTCATAAATA Majority 



654G 

JL 



6550 

JL 



650I AATACATA G GATGTG-A A TPTTta t t n r r * ^ ~r * ^ ~ * 1 — : — 1 

-^cataggat^g^gcTttattccc^ 

TCAAAAACTCCACCTGCTAAAAAATTATTATCAT T TTCGACATTAACTTT Majority 

6560 6570 6S80 6590 ~^00 



6651 
66S1 



TCAAAAACTGCACCTGCTAAAAAATTATTA TCATTTTCCACATTAArTTT u, ,, 
TCAAAAACTGCACCTG C-T AAAAAATTATTATCATTTTCGACATTAACTTT a909~al2!seq 

CTGTAGTCGTA-CTTTTTG CTTG ATAC G TGTATTGGTAAAG'CTAATA TCT * Majority 



6650 



CTGTACTCG T A CT TT TTGCTTGATACGT GTA TT GCTAAACCT A A'T A t r t I ,o 
CTCTACTCCTHCTTITTCCTTCATACCTGT A.T TGGTAAA.GCTAATATCTA a909^^l2^seq 

CCTCTCCTCAAACTGTC ACCCA T TG TAAGCCCCTACCATCATAACTTTTA. Majority 

6660 6670 6680 6690 6^0 




6701 
6701 



675 1 
6751 



TCACCTTCACCAGTTCCT'ACATTTTT T TCTG.TAATTCAC T C AGATAOTTT mocity . 

67 t 10 a ? 2Q 6730 6740 67S0 

T^rr^rr*^*^?^^ <*>hl ai2 sea 

TCAGCTTCACCAGTTGCTAGATTTTTTTCTGTAATTGACTCAGATACTTT SooIaiZ^ 

AA A T T C A T C G T A G C C T T G T T C A TCTA T TC ATAT AC A ACT T C CATAAGGTA" Majo^ 

6760 6770 6780 6790 . ~e^00 




6801 
6801 



6851 
6851 



C T T T A A AT T 6 C T T A G T C T C A C C A T C T C T C A G n G G A A A A T T C T C T T G TT G C. Majority 

6810 6820 . 6830 6M0 : 6850 

s{}-{:;.:??^j.f{SJs?!jsssj?Ts;s{{jtsj{nf.i?i?j?ijjsf aas; 

A A CG T T T C A C T T'G G A T T A A A C A A G A A G T C T T T C G T C T T A T P. T T C A T C t A C Majority 

6860 6870 6880 6890 6^00 




6901 
6901 



6951 
.6951 



-7001 
7001 



.7051 
7051 



7101 
7101 



TCCAACGACAGTTTTACTTACTCTGACGGTGTA TTCTTTAGGTTGCCAAA Majority 

6910 6920 

JL 



6930 
JL 



6940 

JL 



6950 

JL 



T.CCAACGAC'AGTTTTACTTACTCTGACCG TCTATTCTTTArfTTrrrt a V 
TCCAACGACAGTTTTACTTACTCTGACGGT T GT:T T ^t T n:^^^L\ A : 

C AGCATAT A ACGTATTTGTTGC ATGAGGGTTGT T ATC AATACCTATTCAT Majority 

6960 — -1 ~"* ~ - 

JL 



6970 

JL 



6980 

JL 



7000 

JL 



C A G C A TAT A A GGTATTTGT.T G C A T C A G G G T T G T T A T C A A i A r r t l t -r r> * I 
C AG C-A T A T A A G G TA-T TTGTTGC A t C'A gJ'.Lt T G .TT1TC A AT. A C C.T''X* ? T- G A T So9Zaiz!se^ 

T G A C € T G CTGT A. A ATT CC AC ACGTCCTGT AT C A G C T A A AT'C'C f T A T * A x £ Majority 



7020 



7030 

JL 



7040 

JL 



7050 

JL' 



T C A C G T C C T G T A A A T t C C AC A'CCTC CT GT A TC A CC T A A A T C CT T A T r A x r m U 
TGACCTGCTGTAAATTCCACAGCTCCTGTATCAGCTAAATCCTTATCATG a909^ l2 .s2 

ATGCCAACCAAtAAfiCTTCTAACCTGTCGTTGT A AACTATTGC^TTTC AG Majority 

70 . 60 W° • 7o>o 70W ~7tm 

. C A A C C A A T A A G 0 T T G T A A C C T G T C C T T G T A A A C T A T T G G T T T X C A C aOOg^iiZ.seq 




ATTGTAGTTGTGCTATTCAACTCCATACGCGGTGTCT ctu I cttctttt , , 
GAATTCTAGTTrTrrT»TTfi«rT/-/-.i7„„ ACTTGTGTT cohl_al2.seq 

i» » A « in « TGTGCTATTCAACT-CCATACGCGGTGT. CTCTACTTGTGTT a909 a!2 
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7151 
7151 




7201 
7201 



TGAGGTATCTTCTAATTTGATATCTCCTAC T GCAATAATCACAGGTTTTA Majority 

7210 7220 7230 7240 7250 



Tclcc?^?-C???? W^^t^ TCTCCT ACTGGAAT AATGAC AGGTTTTA cohi_.al2.seq 
TGACCTATCTTCTAATTTCATATCTCCTACTGGAATAATCACAGGTTT T A a909lai2.seq 



7251 
7251 




7301 
7301 



T G G T gHaIIItHZI a r^^i^ G€TAA A T 6 G G ^ G T C A A T A TC A A T G G A A coh,_ai2.seq 
TGGTGATATTTTTATTAGCATCTGCTAAATG-GGCGTeAATATCAATGGAA aS^Ia^seq 

TCATATGGGTTATAAATTTTACCATTCTACC A CCAGCCACGG.AAACGATA UaJoMty 

7310 7320 7330 7340 7350 



T C A T A T G* GGTtIt A J Itttti rS A ^ TGT AG ^ cohl.ai2.seq 

TCATATGGGTTATAAATTTTACCATTGTACCACCAGCCACGCAAACGATA a^IaiZ.seq 



7351 
7351 




GCP Alrlrrr A 1^ C G G A C G T C T C A G ^ A A C G C T G A- A T G G C A G C C A T C G t cohl ai2.seq 
CCCA.TCTGCCATTGTC.GGACGTCTCAGTAACGCTGAATGGGACCCATCGT 3909^12^ 




7401 
7401 



C t I i r r a * £ r ? H TZ 1 k G * A ° G ° T ° A T T A C T T CT T T C T A A T T C.C T-C A C C A cohl ai2 seq 
CATAGGAAT C A GG GT C AGTAG CC T C A IT AC T.T GTTTGT A A TTG CT G A C C A a^ai^ 



7451 
7451 




G A AG CATC C A A T crlrr ZZZZZ * ^ ° T&UCCAACA G CAT CATTGCTGtI coh^alS.seq - 
GAAGCATCCA AT GCT6GCTTt.CC AT C-T G T A C C.A ACAGCATCATTG CTGTA a909lai2.seq 



7501 
7501 




£ A ^ A £ A £r A r AA ^^ A ^^ al2.seq 
TATAATA T A A T C T.C C A G C C.T T T C G C C.A AATAG.CTCTTAA ATT GAT AT a909lai2^ 



7551 
7551 




C??cicjri? A rr A ^ ^ A A A A G T T A T A A G G ^ C T A A T A C T A C C AT C T G G i cohl_ai2.seq 
CTTGAGTTACACCACCTGiAAAGTTATAAGGTCTAATACTACCATCTGGA a909_ai2.seq 



7601 
7601 




TTAAC 
T T A A e 



cirA^T t S'S ^ilityil^l A ^^ CGtCTtrT ^ CGTACTT ^ A eohl^ai2.seq 

? AT A A T A C C A -A C C G A C T A A T T T G.T A A G .C G T^TTT'T A C GTA C f T.G'T A" a909^_ai : 2 .seq 



G T|T A G T--T..G' T T : G T A -T C A A C A. TTT G' ; A G A G A C T -A G' T A T C T G T.C G T* T A A T A^ Ifat^ri^ 

7660 7ftvn • - . • 



7651 
7651 



7670 



7680 



7690 . 



7700 



£ I f* G ' T T G T T A T G A A G ATT T G A G A G A p T A G T A T.C t G TCG T A t A A T AG G* cohl al2 sea 
C T T*A G T T G T T G T A T C A A C A TT T-G A G A G A C T ACT ATCTGTCGT AT A AT AG.G a^Ia^seq . 

; • A T G T 3ui*; G X T G A-G T C GGGATCTTT A T C T C G T G A A T'C A T-A C TTA T, A AT A A Majority. 
* ; V}° 7720 7730 7740 - 7750 

C il C T J film Ftl * 1**11*111 A Z* li 5 U M 55 « A G J T A T A A t;A A ^.ai^seq . 



7701 
7701 



7751 
7751 



C A.T CTTTAGTTCX 6"T C QCW T C T XT A T C T'C G t.G A A T CA T ACTTATA-A T A A S^Z^ 

* * * • 

ATGJACCTGAACC.ATCTTGCATATAA T CCCTTGTAATATCTGTATAATC Majority ' 
71 ™ 7780 779Q 7800 

TTiy^Ty^fHTFT^^ cohl^.seq 
TATGTACCTGAAGGATCTTGGATATAATCCCTTGTAATATCTGTATAATC a909_ai2.seq 
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78 lO 



7820 
J. 



AATACG TATCATCTCTTTTTC Uajority 



7830 
_l 



7840 



7850 

_L 



7801 C G GAATACGATCACCATAATrrA aB t r t a 1 * *r * „ ' ■ 1 

7801 CGGAATACGATCACCA T A A T G C A Acl PTAaav^^ cohljUZ.seq 

^ACCATAATGCAAGTCTAAATAGGTATCATCTGTTTTTG a909_ai2.seq 



7860 



7870 



7880 



7890 



7900 



»bfc»ilAATATTCACACUTATCTTACCTTTTCC a909_ai2.seq 




AAATACTTGTCTTTCTGCT TCCTC T TT ATACCATTTACCATCCCAAArAT toJorlt:y 

7960 7970 7980 . 79*90 ~ 1^00 



SS! ACC .^0CmACT ACCTTTAGGTTG A A cflit TTGTC G T A T C G C G GCT A T A 1 

sooi accctgctccactacct™^ 



GAGGA.CAAATTTT GCTCATATAGAAC ATCCTTTA C TGC AAAATTACGAAC Majority 

8070 8080 8090 



8060 
JL 



8100 



8051 GAG G ACAAATTTTGCTCA T A TAGAACA Tf* P ttta r t r* r • a 1 . a ^ ^ ' ■„ „ „ — 

8051 CAGGACAAATTTTf r t jT ATAi^?t^.iS C I TTACTGGAAAATTAGGAA G cohl_ai2.seq 

u A U C A C A A A T J T T G C TG A T A T A G A A C A T.C C t T T A C T G G A A A A T T A G G A A G a909_al2.seq 

. CT " C ^A^ArCAACCCC AUTA AA TA TTTAATOTTGTATCAATTAr.r.TT ^ oMy 



8130 

_« 



8140 

JL 



8150 
_• 



8101 C,T C-TG TATTATCAAG.CGGATCTAAATATTf AATCTTfTATrnTT.rrTT u, ■ _ 
8101 CTCTGTATTATf A A c r r r A t r -r / V V i V i i i. . 1 T G T A T.G A A T T A C G T T cohl_ai2.seq 

u * « T A T T A T C A A G C G G A T C T A A A-T A T T T A A T C T T G T A T € A A T T A C G T T a909_.ai2.seq 



CAT.ACCATACCACTAAGTTCAAATAATCTT T GTGGTAGTCTCCATATTTA Majority 

8170 8180 . * 8190 

J : JL 

C 



8160 

JL 



8JS1 CA.TAC-CATACCACTAAG T T C A A A T A A T C T T T G T G G T A G 
8151 CA-TACCATACC ACT A AG tB|c AAATAATCTTTGT gQt aBt 



CCATA 
[T C C A T A 



tHt 
DtD 



8200 



A cohI„al2.seq 
A a909„a!2-seq 



TCGTAGTATTCATCT GCGATT GGC ACTTTTCTTTTTGC ACTCGTTTGTCT Majority 



8210 
-1 



8220 
-i. 



8230 



8240 

JL 



8250 

JL 



V^ 1 T C II * At; i A 1 1 C A T 0 JT CEB A T_1 G GQ-A C T T T TfcH TfWl T G C A C T r r T T T ct % r t h , ,o 
820, T C G T Aft A T T p A T cQ G C G AjSfr G G C A cHE T T^ifr^B t C A C ffl C TT SS^^ 

. tCCGTTCTCAtC A A A T A C G T A A T T A T C T G G A T A T. A A G C T T T G A T A r. T g t x „ aJorlty 



8260 8270 , 828 0 8290 . 830 0 




T A A CJA T T.A C T-A^G G.f A -T T T TTCTG T.A A A C G T A A.A T T C G T C T C € T C C A Majority 



8320 

JL 



8340 

JL 



8350 




8351. G C A C €.T.C jC_ 
8351 GC A C CTCCCC C T 



C T r T rn£ a t aW a^t a™ 3 T G C C A T C >» ABHC C T T.G T T t cohl^aiZ.seq 
G T C T GOT A A ADA G T A T TTG C C A T -C T A G T C C T T G T T T a909_ai2.seq 



8400 
t_ 



8401 
8401 



G T AG AA CGGATAATTTTGAATT CTCT TCCr. T TTTCGATAGACTTTTATTT Majority 

84 ?° . S430 8440. . 8450 



8410 



gBa g a a cog a taaJJ 
Htag a aBgHa t a a T 



ATAAHttTG A Afl TE|TJHT_TC, C C TTTT CG aH.A C A G T T tW A tI^T cohl a!2 seo 
ATAATTTTGAAT T Ctafc Tfflc CHOEHG G A T ABlAljT T HATH a909:ail;seq 
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S BS^lBMlB £g gSeai ff g G I T A g T A T G A A C T C A C C C A A A Malorl.v ^ 

B4 *° 8470 8480 8490 8500 

c a txtcBa T|gBrg|aTrTTMBBBM l a a lMI i 



8451 



845 C A tHHIg G A tT?F ^WB ^ G £ A ™ A T G A A C T C A C C C A A A cohOU2.seq 

8451 C A T|MG G A T T T C C tQB aQ G T A C T C C A||TG G g|J A G T A tQ:A ACTCACCCAAA a909_ai2.seq 

TAACTC ATTCCTTATCTTCCACTTTGGT TATTTCC AACATTGGTTAGGTA Majority 



8510 




8530 8540 8550 

-i - i_ 



Sol T A A C T C A TgC C ^L AI^GT.T C C AEfll T G G T T A T T T C C A A-C A 1 T C gBt aIBt 1' cohLal 2 .se q 
8501 TAACTCATTCC T0A tBHt C C A G 1 tKIBBHMHHBB GTTAGGTA a909_al2.seq 

ACC-CCATGCAGCTGTCTTC C-ATTCATACCCATTCGCG GCTA AGGTTCTAC Majority 

. 8570 8580 8590 8600 
•» 1 ^ ^ ~ . I . ■■ '_ w — ■ -ml* — -l_ 



8739 
8736 



f^? A C C AT G C AC CQ GTCTTC CATT G A T A fl C C A T xfl G C G G d& A A G G T T G T A C cohl a 12 ^eo 
8536 ACGCCABgCACCTGTCTTCCABtGATAGCCATKG cQg C T A aQgiKg TAG So^Iai^ 

CGTATAGTCCTGTGTAGGTT TCGCCATCTGATGCTCTAGTTCTAGGAAtC Majority 

86 . 10 863Q 864 6 86S Q 

till cmrt^l g r c cHr tHt^S I«I g c a ^ g a " g C t cfl aBt t»taB g a a t* cohi_ai2.seq 

8586 CBTATAG.TCCBG TflT AGGTTTCGGCA tBt.G ATGCBCTAG TBfcHA G G A A HC *909_al2.seq 

CTACTATTTTCCTAATGAATC TCCGAGTAGCCCTTTTTTGCAAATTTTAT Maforltv 
8660 8670 8680 8690 8700 

8644 G i£ AE l£ A JL Tn ^ G cHa aBg a ajJHt c c~gBHB|l cfl iB r t tt g c a a a t t t t a t «,hi ^ 

.8636 cHa OHaBt T T offr A ATB A AQC T cHg^S^KC €{^8^11 T Tflc C A A A Tfl tH AT So9^^ 

TCTGAtCAGTTTTCTATCATAAf A AA CATTAACCACACTTCAACCATCGT Majority 

87 . 10 8720 8730 - 8740 . " 8750 

8^86 WG^A^r??n^x A J^ A J A AT AAACATTAAgHaCACt1gAACCATCC : ; cohl^i2.seq 
8686 E G THaUJA G T TflT CTAT CAT A A T A A A C A T T A A C G A C A C T T G A A C C A T C G T. a909_.al2.seq 

G T T TT A T,C A T G A C A C A A G T T I C T C T C C T C G T A T T A T T A A C T T T A A A C C C A Majority 
•■ 8760 fgTO ,8780 8790 8800 • 

£ II I 1 A "IS A I S A - A C A A G T t T C ? G T C C TCGTATTATTAACTTTAAAGCCA cohi a!2.Seq 
CTTTTATCA.TGACAGAAGTTT C.T G TC C T COT ATTATTAACTTTAAAGCCA a909Zai2.s^ 

CTCC GTAGTTTTTCATTAA TATCTTGTTGTGTTAGC CTCTG ATTAG ATAA Malort^ 

8810 8820 • ' "8830 ' 8840 . 8850 

! 789 G T C,G G T A G T T TflTCAT T A A T A TC TT-G T TGTG TTAGCGTCTGATTAGATAA cohl a!2 sea 
8786 G T CGGB-AHtT TtBc ATTAATATCTTGTTG.TGTTAGCGTCTGATTAGATAA S^i^ 

ACATAGGCCTGATCGTGT TACTTG CCC TGCGT.ACTCATATGTCTTTTCCC Majority . 
" , 8860 ,8870 8880 8890 * . 8900 

A G A T A G G C C T G A TC G T C T T A C T T G C C.C T G C G T A C T C A T A T G T C T T TTCCG a909lal2.Sq 

CATCAGfAGCATTTTTAT T A TCC GTTGCTCATTGTTCCC AGT ACTTTATC Majority 

- ■ ; ■ ■ *y° , ... 8920 \ 8930 ■ 8940 8950 

$ A I T A C C A T T T T T A T:T AT C€ G'T T GC ft ATT Q TT:G ; GC.A GT A G ITTATC £ohl al2 seq 

C A T C A GT.A G C A T TTT TAT t A : T C C ;G T T „G G T G A T T G Tt G G C-AG T A G TT t AT 6 SSlai^sS 

,G T G T A CjG T T G T T t G T G CCG G C.GACC A AT GT G C AT AT A GC ^ T C GT AT C C T f Majority 

- • ' . ;89B0.;. — • ' 8970 . 8980- • 8990 9000 " 

$3 S ^ A C G T TO T T T CJ T G C G G G C G A CCA A X G T G C A X A X A G C G T C e T A T C G TT cohX^ai2,^ 
G T G T A G G T T G.T T T G T G C G G C C G A C C A A T G T G C A T A T A G C G T CGTATCCTT a909_ai2.seq 

." . _ 

G G T C A A G A C T T G A T T A A A A T C A A A 0 G C T G T C C C A C.C A C T A G C A C C T G T G T Majority 
90i0 .9020 9030 . 9040 ~? 9050 

S S I S A t S A C T T G A T T A A A A ^ C A A A G C C T-cic CC ACCACTAGC AGCTGTGt cohl_ai2.se<i 
GGTCAAGACTTGATTAAAATCAAAGGCTGTCCCACCACTAGCAGCT.GTGT a909„al2.seq 

A C C A C C C T G C A A A ACT ATAACCTG C CCTTCTTGG ATCATTAG.GCTTAATT Majority 
;^°8° 9070 9080 " 9090 . 9100 

9039 A.CCACCCTGCAAAAGTATAACCTGGCCTHcTTGGATCATTAGGGTTAAtI cohi.ai2.seq 
9036 ACCACCCTGCAAAAGTATAACCTGGCCTTGTTGGATCATTAGCCTTAATT a909la!2.seq 



8839 
8836 



8889 
8886 



8939 
'8936 



8989 
8986 
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GTCC A„A„< 



tf ^jg mS£M eyP.* A CGAGCTGCAA T ATAGGTAAC Majority 



9089 
9086 



9110 9120 9130 9140 9150 

1 mm - 1— L_ . | !_ 



9139 
9136 



'9189 
9186 



GTCGAAGC AG G T T G Gflc TGTT A A C A C A C C A CG AG G TGC A A T A T AG CT A A C cohl ai2 seq 
G T C G A A G.C AG GTTGGTCTGTTAACACACGACGAGGTG C-A ATATAGGTAAC a909_ai2!seq 

TCCTGTTCATAAGTCCCC TCTCTTCAATTCAACACCCTAG.GATTCTTTA'A Majority 
9160 9170 9180 9190 " 9200 

TCCTGTTGATAAGTCGCCTGTGTTGAATTCAACACCGTACGATTCTTTAA cohl_al2.seq 
TCCTGTTGATAAGTCGCC. TGTGTTGAATTCAA-CACCGTACGATTCTTTAA a909_ai2.seq 

ACACAGGATAA ACACT T.ATCT C T G A AG AT A C ACCT A A T&CTTC A A TT TC T Majority 

9210 9220 9230 9240 9250 

J « — L _I ; |_ 



9239 
9236 



AGACAGCATAA.AGACTTATGTCTGAAGATACAGGTAATGCTTGAATTTCT cohl_ai2 seq 
AGACAGGATAAAGACTTAT G-T.C-T G A A GATAC AGGTA ATGCTTC AATTTCT a909_ai2.seq 

^ • * 

GATTCAGAAAG TGGAGCTCCATTTTCAGTTTTAGACCAACCTACAAATAA Majority 

9260 9270 - 9280 9290 9300 
* t • i t 

GATTCAGAAAGTGGAGCTCCATTTTGAGTTTTAGACeAACCTACAAATA A cohi_al2 seq 
GATTCAGAAAGTG^AGCTCCATTTTGAGTTTTAGACCAACCTACAAATAA a909_al2;seq 

TAATGTTGAAG TGGGTGCTGAAACTTTAAATCAACTTAGATCCGCG G TTC Maloritv 
. 9310 9320 - 9330 9340 9350 

9289 B A A T G T T G A A G T G G.G TG G T C A A AC TTT A A A T G A A C T T A||AT€ CGCGG T T C cohl_al2 seq 
9286 TAATGTTGAAGTGGGTGGTGAAACTTTAAATGAACTTAGATCC GG..GG TTC a9C9_al2.seq 

CCTTTCGTCA A CTA TC C A A T G G T A CTTGATAACGCTCC A AAATACTTTTA Majority 
. 9360 .937 0. .9380 9390 9400 

9339 CGTTTCC T G A.A CTATCCA AT.GG TACTT.GATAACGCTCC A AAATACTTTTA cohl ai2 seq 
9336 C G T-T.T CCTGAACTATCC A .AT GGTAC.TTCATAACGCTCC A AAATACTTTTA a909_ai2.seq 

CCAGCAGAA T C A tC AT ATA U CTT ACTG TCC C C A CTTT ACTC AT A A A T G G Majority 
'. 9410 942d .' . 9430 ; 9440 9450 

C C A G C A G A A TC AT C AT A T A A ACT T AC T/G T C G C C A C T T T A C T C A T A A A T G G oohl_ai2 seq 
CCAGCAGAA TCATCATATAA A GTTACTGTCGCCACTTTACTCATAAATGG a909_ai2.seq 

A C C A AC A T A A A T TTC T TTT.CT C tC AGtT AC.AGTT ATTGCCTC ACC A A ATT Majority 
■ • 9460 9470. 9480 9490 9500 - 

A C G A A C AD A AATTTCTTTTGTCTCAGTTACAGTTATTGGC T C A CC A A ATT cohl_ai2 seq 
ACGAArCATAAATTTCTTTTGTCTCAGTTACAGTTATTGGCTCACCAAATT a909_ai2.seq 

TAACAGGGTC ACCATACTTTC.CAGTAGTAGGATCATAGGTATACCAACCA Majority 
9510 . . 9S2Q. 9530 9540 9550 

TAACAGGGTCACCATACTTTCCAGTAGTAGGATCATAGGTATACCAACCA cohl_ai2.seq 
T A A C A G G G T C A C C A T A C T T T C C A G T A G T A G G A T C A T A G G T A T A C C A A C C A a909_a!2.seq 

T T A A A ATCCTCTCCTGC TT T A ATC GTCG C A ATC C C A ACT TCTCCf AC AG A Majority 
9560 9570 . 9580- 9590 . . 9600 

9539 TTAVA A ATGCTCT C C T G C T .T T A A T. C-G T C G G A A t C C C A- A CTT C T C G T A G A G A txjhl>12 ;sfeq 
9536 T T A A A ATGC.T/C T C C ; T'G C T * T A -A- -T f 6,T C G G A AT C C C A A CTT Ct CCT AG A G A aQOS^al 2 .seq 

* . w • ■ » 

. T T -CT 



9389 
9386 



9439 
9436 



9489 
9486 



9589 
9586 



r C C A TiC TT T.TATA A T T T G A T G A t G, A A C T T G C-A T A C :C T G A A GGTCTG A Maloritv 

, * ~ ■■ . ■ r ~^~. r T ! ~ T ~ T ' r* • J 

9610 ; 9620 ; ^ • 9630 \ 964Q 9650 * 

1 ' ■ V' '« '—f^ ¥ ' 1 ■ • •* 1 



9639 
9636 



9689 
9686 



T T C T C C A TCt TTTA t A A T T T 6 ATCATG A ACT'TG C A T AC C T G AACC.TGTGA coht_ j ai2.seq 
TT C T C C A T-C-T T T T AT A A' T T T G A T'G : ATG A ACT T G C A f A C C T G A A C C T G T C A a909_jai2.seq 

C G A A A T T A T A A T C A G TTC CGTC' A T'TAT'TTTGAA A ATGGTA'A CTTA ACCT A Majority 
• 9660 .9670 \ ' , 968Q - - ' • 9690, . - 9700 

G G AAA TflAT A AtC A G T T C C G T C A T T A T TT T G A A A A T G G T A A G T, T.A'A CC T A cohi_al2.seq . 
G G A A A T T A T A A T C A G T f C C G T C A T T A T T T T G A A A A T G G T A A G T T A A C C T A a909_ai2.seq 

■ ■ 

GCAAC TTCT G. TA T'T.A TCCTCTTC A ACAATTCCAT A AATCGACA ATGA ATC Majority 
9710 9720 ■ 9730' " 9740 9750 * 

GGAACTTCTGTATTATCCTCTTGAACAATTGCATAAATGCAGAATCAATC cohl_ai2.seq 
GGAACTTCTGTATTATCCTCTTGAACAATTGCATAAATGGAGAATGAATC a909_aI2.seq 
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T C T T fgfl AgAg dOT fcg fate A'cgc.'fe gggj TTCTTACTTTCTCCACTATCTT MajorT^ 

97 f 6Q 977Q 9780 9790 9800 

9739 TCTTTTAAAAGCAACATCACTGCTAGTGTTCTTAGTT T'C TGCAGTATCTT cohl aiZ.seq 
9736 TGTTTTAAAAGCAACATCACTGCTAGTGTTCTTAGTTTCTGCAGTATCTT a909lal2.seq 



6 



T A C ATTTT A A T AC TTCTC TTTC A C C HTC A TCTTT A A A GTG A,AC A ACTTT A Majority 
A810 9820 9830 9840 9850 

9789 T AG ATTTTAATACtTCTGTTTG ACCATC ATCTTTAA AGTC AAC AACTTTA cohl_al2 seq 
9786 TAGATTTTAATACTTCTGTTTGACCATCATCTTTAAAGTGAACAACTTTA a909_ai2.seq 

AGCTTT TCATCTGAAGCTTCTAATCCCTT.ATCAtAGTTGAC CT CTACTTT Majority 

9860 9870 9880 9890 9900 
J 1 i • — J ■ 



S«39 AGGTTTTCATCTGAAGCTTCTAATGGCTTATCATAGT TQA CCTCTACTTT cohl_ai2 seq 
9836 AGGTTTT GAT CTGAAGCTTCTAATGCCTTATCATAGTTGACCTCTACT T T a909_al2.seq 

■TACTGGGGC.TTG GCGTTCTGCTTCTTTACCATTTGAC T C A.ATACTA ATGT Majority 
; ' , "IP 9920 9930 9940 9950 

9889 -TACTGGBgCTTGGGGTTCTGCTTCTTTACCATTTGACTCAATAGTAATGT cohl a!2.seq 
9886 T A C T G G G G.C TTGGG.GTTCTC CTTCTTTACC ATTTCACTC AATAGTAATGT a909~ai2,seq 

CATACAGTTTGAAGTTTT T G'A TTTCACTATCTTGTTTACCAACTTCTGTC Majority 
.9960 9970 . 9980 9990 10000 

9939 CATAGAGTTTGA AH* TT-TTG. ATTTCA. CT ATCTTGTTTAGC AA cQ T C T G T C cohl.lal2.5eq 
9936 C A T A G A G T T T G A A G .T T T T T G A TTTC ACTATCTTG TTTA.fiC AA.CTTCTG T C a909_al2.seq 

AATGCTTTTTTC TTATAGTCTTT AAA .A GTACCTGA ATTCTCTTtTA ATTC Majority 

10010 1002 0 10030 10040 10050 

9989 A A -T G CT T"T TTTCTTAT AH TCTTTAAAAGTAGCTGAATTGTCTTTTAATTC cohl_ai2.seq 

9986 A A T G C T'T T T T T C T T A T A G T C T T T A. A A A G T A G C T G A A T T G T C.T 1 T T A A T T C a909_ai2.seq 

* * • 

C C'T CA CC TTTAAATCAGCATTTTTAGGAATCTTAGCT.TCTT.TCGTCAAAG Majority 
, " t0060 10070 10080 10090 " 10100 

10039-C GTC A'CCTTT AA ATC AG-C AT'TT.TTAGG A AT C'TT AG CTTCTTTGGTCAAAC cohl_al2.seq 
10036 C.G TCACCTTTAAATCAGC ATTTT'TAGCA ATCTIAG.CTTCTTTGGTCAA AG a909_ai2.seq 

T C A C T G T T A C A G T • A T A G T C T G C A C C T.C T A A. A C A T C A A T C G T T C T T C A C G C Malorltv 
j 10110 10)20 . 1013a . • 10140 10150 

10089 T C A C t G T T A C A G T A T A G T C T G C A C C T C T A A A C A T C,A A T G G T T C T T C A C G^l cohl_ai2 seq 
10086 TCACT'GTTACAGTATAGTCTGCACCTCTAAACATCAATCGTTCTTCACGG a909_al2.seq 

TAACC AGCTTCCTCAGAAGATGATGTTTCTGTTACACTAGA AG C A G G A G T Majority 
_ • 10}60 10170 10180 10190 10200 

/ 10139 TAAGCAGCTTC.CTCAGAAGATGATGTTTCtGTTA'CA CT AGAAGCAGGAGT cohl__ai2.seq 
10136 TAACCAGCTTCCTCAGAAG AT GATGTTTCTGTTAC AC TAG AAGC AGG ACT a909_ai2.seq 

C T C T G G C T TG C T C T G C T C A A C,A C T T G A TT.G A G A-A CTACATG TTG.ATG A AG Majority 

• • 10210 1Q220 1Q230 - . ' 10240 10250 

i?* 89 C X COG G C T T G X TCT GX T C A A C A C TT G A T T G A G A A C T A G A T G T t.G A T G A A G €ohl^al2.seq 
i0l86 CT GTC 6CTTG.CTC T G .CTC A; A CACTTG AT t-G AG A AC t AG A T.G t TGAT.G A AG a909^ai2 .ieq 

T T A € C.T G C-CT AG A A TT T.TT A TT TT C-T A A AG T AA TCCC CA.iE A T C A T C T G T 6 Majority 

' . .. iOgfe' . 10270 y IQ&BQ , " . " 102 90 * 10300 

10239 TTACCTGG CT AGAATTTTTATT TT C T AAA C-T A A T C C*C CACATCATCT GTC .cohl^al2.se<i 
10236 TTACCTCGCTAGAATTTTT.ATTTTCTAAAGTAATCCeCACATCATCTGTC a909^.ai2.seq 

Tt'AGTT TCT.TCA AC T G TT ATTGCTGGTA.G A. A T T A A AA AA T A A G T C G T T A A Majority 

. , ' . . 10310 . 40320 10 330 / 10340 10350 

' ' 10289 TT ACT TTCTTC A A C T.G T T A;T T G C T G G T A G A ATT A A A A A AT A AGTCGTTA A cohLai2.seq . 

10286 T T A G T T T C T T C A A CT G T T A T T G C T G G T A G A A T T A A A A A A T A A G T C G T T A A a909_ai-2.se« r 
* * 

A A A A C=T T C.TT A G C A T C A T C A A T G A C C A C A T G A T A A T TTT C C A CT C T.TT A C Majority 

10360 10370 10380 . 10390 10400 

-10339 AAAAGTTGTTAGGATCATCAATGACCACATGATAATTTTCCACTCTTTAG cohl_al2.seq 
10336 AAAAGTTGTTAGGATCATCAATGACCACATGATAATTTTCCACTCTTTAG a909_ai'2.seq 



FIGURE 210 




' ' r n i i|, — ' "||" y*i II it i ) "' fi ,< irinii "";(* ""Ml lV "\\ 

A T G & -E? O'fe T A T A A A A G T T T G A C AC T T C T T T Maiorlr^ 
_____ 10420 10430 10440 Io450 

10389 G G T C TTTTTTCTTTTTTAATGATT CG ATTA TAAAA GTTTGACA FTt7Tt*t 10 
10386 GGTGTTTTTTCTTTTTT a a t r a v -r ^ ^*******-AAAGTTTGACACTTCTTT cohl_a_2.seq 

^TTTTTTUTGATTCCATTATAAAAGTTTGACACTTCTTT a909_al2.seq 
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. _ , 10500 

G A G G C C A TAT TCTGA AACAAAT-TTTACCTAC A ATTTCTTCTTCTGA A AC A Majority 



105 lO 



10520 

_JL 



10530 
_i_ 



10540 
_L 



1055O 

JL 



10489 G A G C C C A TATT-C-TCA AAC A A ATTTT A C CTU A ATTTCTTCTTrTn i ir 1 
I0485 GAGGCCATATTCTGAAAGAAATT T TACCTACAA^^^ 

TCTCCTACAGAAGTAtTTCGAGAATCAATT G AAGTTTTTCCGTTGTCTCG Majority 



10560 
JL 



10580 

-JL 



S TCTC CTACACA AG T'-A^T ^,E^^^^^^^^ A ^^ T ^A A€TTTTTCGGTTGTC T C C cohl^.seq 
IC 1 CCTACAG A AGTATTTCGAGAATC A ATTG A AGTTTTTCGGTTGTCTCC a909_ai2.seq 

T A AT AC AAA AATTTTTT TATCACGT ACTTCAT ATCCCTATTTT AT ATT AC Majority 



10610 



10620 



1063O 



10640 
JL 



• 10650 



10589 T A A T A C AAAA.ATTTTTTT ATCAGGTACTTG AT ATGGGTATTTT ^ t a ■ t t* a I .o 

.o 5 8s.Tat* ^ AAAA*Tt^TTrT^Te»«uc^n:^l^?^^^^^^^^;;^;^J^ 

f ATTACCCAGT.CCTTT ATGAAT AA CAT ATCCTTr.TTTCAACTTATr.TTr.. Majority 



10660 



10670 



10680 
_L 



10690 

_JL 



1070O 

JL. 



T T C 



KGT.AAACAtC.CCCT TG.ACA AT CA ATATTAACCCACTCTC.CT.CACTC Mainrtrv 
10710 10720 10730 . 10740 



1075O 

JL 



10689. TT C A C G T A A AC AT CC C C T TCAGA ATG A AT ATTA A C C CAC^CTCrTrirTr m ,o 
10685 TTCACGTAAAPATP r r r -r *p #* _ _ i -t X ? 7 ~. 7 ^aa^^^AGTCTCCTGACTC cohl_ai2.seq 
»i.itUGTAAAC.ATCCCCTT6AGAATCAATATTUCCCACTCTCCTCACTC a909_ai2.seq 




iosoo 



10739 TGCAAT.A ACCCGCTTGACTAGGACCTTATT ATTGTAGTAAAArrrr_r A A u, ,o 
10735 T'G C A A T A..A C C C G C T T G A C T A G GA C C T T A II ATTGTAGTAAAACG CD A C A A 2>OoZal2.seq 



CGTCTCCAGTTTTAAAATTTGAACCTTTTACTG T AAAGACTACATCACCT Majority 

losio ,0820 loiab " . l0 84o : : ^"so 



1083O 
_. 



10840 

I 



10789 C G T C TCCAGTTTTAAAATT.TGAACC TTTTACTGTAAACACTACATrArPT k. ,o 
10785 CCTCTCCACTTTTAAAATTtmcCTTTtlcT^ 

CC A CTTA A AGTCTTATTCA TTCA A TG TCCGTA.GATTCTTAATA CA G G C A A Majority. 



10860 
JL 



10870 

JL 



10880 

JL 



10890 



10900 

JL 



10839 G GA. C f T.A.A.A GTCTTATTC A T J GA ATCTC C G T A,G At T C tt A ATACAGCPA A _^,» ,p 
10835 G C A r T T A A A.r vr , T*ra .. . - v J. • - - - . A -A A l A .%» A _ G.C A A cohl_al2-.seq 

• • T A A • G T ? T . - T* A X T G A A T G TGCGTA GA.T.T GTT A. A T A C Ai G G'C A A <i909_ai2;5eq 



C CA TA A A A CC G C A 



10910 

-JL 



■T T..A A A A7f CGCT.fi TT GAG G C A A C C G € CA T.jC A AG ATGT Majoklty 



10920 

JL 



10930 
JL 



10940 

JL. 



10950 

JL 



wf£ C C A T A A A* r r~ rrtttT-r^* * ^ T C C ^ ^ T ^ ° ^ G c c ' A A c C G C C A T C A A G A T C T sea 
.10885 CCATAAAACCGCAATTAAAATG OCT CTTGAGGCAACCGCCATCAAGATGT IKlS 

AT:aTGGTATTTTTAATCACACTCCAAAAGCGTTT C TCATAACTTACGCGA Majority . 
- 10 ? 60 1Q970 1Q98Q . . I0990 . . U0Q0 . . 

S A I A T G C I A T Tt t x FFI « A C X C T C C A A A A G, C G. T T T C T C A T A A C T T A C C C G A oohl^ia.seq 
.10935 * T A T G C T A T T T T T A A T G A C A C. T C C A A A.A G C G T T T C T G AT A A G T T A C C C G A aaOoTalZ ^ 



TCCAfeTTCTTCAGATAATTTATCTCAACT A ATCTGTCTTTTCATTGTCTA Majority 

11010 » 020 "030 .11040 ' . ~u£m 



1098& TCCACTT CTTf J f it i i T * T'TGAACIAA T'C TGTCTTTTCATTGTCTA cohl al2.se,, 
1098S TCCAGTTCTTGAGATAATTTATCTGAACTAATCTGTCTT. TTCATTGTCTA agOoIaAz!^ 
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Thursday, Jufy 29;-zoo4 6:49 pm; ' ^ * b 

c c g c pjjg^ ilKiii ^X ' p cttt ktaaatugtatctgca iSJoTI^ : 

11060 ■ "070 HO*** U09O moo 

J"* 3 ? ^ S CTCTCTT ATCTA C'T AAATTCT TTACGTTTTCTA.AATHC TATCTGCA." cohi ai2 see 
1I03S CCCCTCTCTTATeTACTAAAT.TCTTTACGTTTTCTAAATAGG TATCTGCA 

C C . -T AACTGAGCAGTCTCAA AAATACCACTAAGCATTAAGCAAGCTTCCCC Majority 
11110 11120 H130 11140 1I1SO 

11085 GCTAACTGAGCAGTCTCAAAAATACCACTAAGCATTAAGGAAGCTTCGGC a^Iail^ 

AATAGAACCAGCTTTGC T A ATTTTGATTTTTTT, ATCATCTAGCGCTTCTT Majority 

11160 H180 11190 11200 

! 1139 A A TAGAACCAGCTTTGCTAATTTTGA T T TTTTTATC ATCTAGCGCTTCTT cohl ai2 sea 
11135 A AT AC A AC C ACCTTTGCTAATTTTC ATTTTTTTATC.ATCT AGCGCTTCTT alSlal^ 

TAACTTGCTGAATCTCTTT CTCTTGTTTTTCAATAAGAAGTTCCTGTTCT Majority 
' 11210 H220 11230 U24Q H2SO 

11159 TAAGTTGCTGAATCTCTTTCTCTTGTTTTTCAATAAGAAGTTGCTGTTCT cohl ai2 seo 
11185 TAAGTTGCTG'AATCTGTT TC TCTTGTTT T.T C A ATA AGA AG T"T GCTGTTCT al^Ia^ 

AACATAATTTCTACCAACT CTTTTCTTTTTA ATT TTTTTAA ATCTTCC AT Majority 
• Mgg 11270 11280 , . U2 9Q 11300 

1 1239 AACATAATTTCTAGCAAGTCTTTTCTTTTTAATTTTTTTAAATCTT CCA T cohl ai2 sea 
11255 AACATAATTTCTA-GCAAGTCTTTTCTTTTTAATTTTTTTAAATCTTCCAT a9^Ia^2^seq 

.CGCCATTACTTCCTTAACTG AACCTTAAATTATCGTTTACATATTATATC Majority 
11310 H320 . 11330 11340 . 11350 

11289 CGCGATTACTT.CCTTAACTGAACCT.TAAATTATCGTTTAGATATTATATC cohl a!2 sed 
11285 CGCG AT.TACTTCCTTAAC T G AACCTTAAATTATCGTTTAG ATA T t A T A T C SotJ^s^ 

AAAGTTCTAACCTTTAA A CTCATTTT TT G TC C TCTG TTTTTTCTCA AAA A Majority 

■ ■ 11360 . It370 11380 . 1139Q ' / . U400 

11339 A A A C T T C T AAC.C TTTAA.ACTCAT.TTTTTGTCCTGTGTTTTTTCTCAA"AAA cohl al2 sea 
11335 AAAGTTCTAACCTTTAAACTCA T*T TTTTGTCCTGTGTTTTTTCTC A A A A A aOO^l5l2^seq 

ACTCTATGCTAAATTAACA TTTTTGATAATT T.TT TGAAAAATCTCATCGA Majority 
... 11410 11420 11430 . 11440 11450 

! 1389 AGTCTAT GCTA AA TTAAC AT.TTTTGAT-AATTTTTTGAAAAATCTCATCGA cohl ai2 sea 
11385 AGTCTAT GCTAAATTAACATTTTT GAT "A ATTTTTTGAAAAATCTCATCGA So£j£s^ 

AGT.CATTTTCTTTTTGAAA GCTCGAATTCTACGCATTAAAAAGCCATATA Majority 
11460 11470 U480 U 490 11500 

™ A ?rg ATTTTCTTTTTC A A ACCTCG.A ATTCTAC GC ATT A A 1 A A fiCC AJTTZ cohl al2.seq 
11435 AGTC ATTTTCTTTTTC A AAGCTCGAATTCT A GGCATTAA k A MBj— aQoCai^ seq 

T C.A A ATTGATATATGGCTT TTTTTATTATTTAAAACA.AAACAATCAATAG . Majority 

.. U ? 10 ' . . 11520 . 11530 . 11540 - U550 

\\ ^^^^^^^^^^^^^X^ T T A' t.tI A A A A C A A A A G A A T 'c a A t A r. ^„ seq 

G A 6 A AT A G C^f C A A'TTTAtT- Gk^f A A TC T A ^ t/AC A G A T T A AG T T C TTT.T Mat oritur ' 

1,539 £ A C A A T AG CG T C A A TTT AGTGAC AT A A T £ TAT T A C A GAT T A A GT tCTTT T cdhlai2 sea 
11492 G A C A A T AG cH T C A. A T T T' A G*'T 6 AG aQ A AT C TJl T T- A C A G A T T A A G.T T C T T TT .a909^i2^2i| - 

TGAATAAT A-T A ATCC AACT TTTCA ACTGT-TTTTTCC C ATCTGAAATGTTC Maloritv 

J 1161Q \ 111620 11630 11640 * U650 

MfS I $ A A T A A T A J A A T C C A A C T t-T T C> AC T G T T T T T t C C GATGTG A -A A T G T T C cohl- ai2 seq 
1IS42 TGAATAATAf A ATCC AAC.Tf TTC.AACTGTTTTTTCCCAlHGTGA AATGTTC SS^l^ 

■ • ■ 

T T T A A T T C T T T T A G C A A T A T T C T G T T G T AG TTTC TCT CTTAATGCCTTAT Maioritv 

*" 11660 11670 11680 ' - U690 11700 - * 

116 39 TTTAATTCTTTTACCAATATTCTGTTGT ACTTTCTCTCT-TA ATGCCTTaI cohl alk.seq 
11592 TTTAATTCTTTTACCAATATTCTGTTGT aB[t*T T CTCTCTT A ATGCCTTAT a909lai2.seq 
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11689 CTTf TACTAATAAATCAACAGATT CATCCACTCACTCACTATTTTrTTrr k, o 
11643 CTTTTACTAATAAATCAACAGATTCATCCACTGACTCAGTATTTT C'T TCC a^wZal2!seq 



11739 ATCATCATTCCTAACTCAGGGCTATCAA TAACTTr A a r t r t t r r a ^ ^ 

U~ ATCATGATTCCTAACTCACGGCTATCAATAACTTCAAC^ a^09Zal2_seq 

ATCTCTTCCAATAATACCACT.CGAAAGTACA C CACCTTCTAAAATACAf, fi ilaj orlt y 

. : 11810 "«» »83o ; : n£o 

U7 ^ ATCTGTTGCAA T A AT ACC ACTCGA A ACT AG ACTA f r t t r t'a a a _ t a ^ , ^ 
11742 ATCTGTTGCAATAATAroArxooV.*^i.^?!: GCTTCTAAAATAGAG G Cohl_alZ.seq. 
AiUtTTGCAATAATAGCACTCGAAAGTACACCACCTTCTAAAATAGA.ee a909__al2.seq 

TTGCTAATCCCTCTCGATA.CATTGA.TGGGT A AACAAACATATCTCTCTGT. Major! ty 

U86 ° 11870 H880 U 890 _ ~ H900 




11950 



11889 CCCATTAAAGACATACTCTGTTCAAAGT TTAATTTCCCCAAAAAGTTAAT ™ut 
11842 GCCATTAAAGATATA CTnfr-T'rr* * _ * * *-v*i^caaaaagt. TAAT cohl_ai2.seq 

^vt.tAllAAAGACATAGTCTCT.TCAAAGTTTAATTTCCCCAAAAAGTTAAT a909_al2.seq 



^ T ° T T T GGACTGATATTTCTCTTTCAAATGT- C CTAATTCAGGTCCGTCTC Majority 

11960 11970 H980 . 11990 1^000 - 



11939 C T C TTT GG ACTGATATTTCTCTT T-C AAA TCTGCTA AT TCACCTCcrTrTr ut 
.11892 CTGTTTG*GACTGATA T.T TCTCTTTC AAATfiTCC T A A T'T C A G G T C C G T C" T C a^9^ai^!seq 



C-T G C A A T C T G T A A A T A A AC A T T T T C A G A G T A C T C-T G A C A T C G A A A A T G C T Majority 

1 2010 . . 12020 - 12030 .... -1204Q ^050 



12020 

JL 



12030 



1204O 
Li. 



11989 CTCCAATCTGTAAATAAACATTTTCAGA CTACTCTGACATrfA A a a TrfT u< ^ 
^-CTGCAA-TCTGT.AAATAAACAT.TTTOAGAG?AC 

TCTAAGAGCAATTCAATGCCTTTTTCTTTAA T AATTCTACCAGCATAAGT Majorl ty - 



12060 
Jt 



12070 
_I_ 



12080 



12090 



12100 

JL. 



12039 X C X A A G A GCAATXCAATCCCXXXTXCX XX A A TAATXCXACCAc'pata apt ™k. ■ -> 
11992 T C T A A G A G C A A X X C A A X G C C X X I X X .C X X X A A-X A A T X C X ^ C-C AGCATAACT 2o9_al2]seii 



GATCAAAATATCATCAGCACATTTTTCAAG G .TAAGCCr.TCTCA«;nA A A AT Kajonty 




C A G A-G.C CTAGACT. TTCAGATA.CCGAA TT AT A A A T A A CT CCTTTAGCTT g^T Ifajorlty 

• - " I2?6Q 12 I 8Q : " 12190." : ^200 

12139/C A G AflC CT A G A C-T T T C A G A T AC CG A ATT ATA A A T A AC Tr rTTTAr^-r-rrl » « _o 
12^92. e A G A^G C t A:G A C XT T € A G A X G G AA T I A* A 5 At A A U MS J f „ c C X 

. AT A T T A A A A TGT-t T T A A C C AT T-C A. ACG C .T T C "T/C't TC G'At -A' CC- G C AT A A A . A Majority 



12210 



12220 
JL 



.12230 



12240. 
JL 



12250 

S txATXAAlAXGXXXXA^ cohi.aiiseq 
12142 * T A T.X A A A A X G X X X X A A C C A X T C A A C C. C X X C T C X X G G A T A C £ G C A X A A A A a9b9_ai2.s«, 

. A X.C TCC A C G A X A G X G C f X AACACCCGC X G.X G A G A A C A T G XX CAXAG AXAC tt a , rt .W 

• ^f 60 ■ 12270 12280. . 12 290 . 12300 

S ^T^C^cl-AT^p'pnMp^ cch._al2.se, 
12192 -A T C T G G A C G A X A G X G C X X A A C. A C G C G C X G X. G A G A A G A T G X X C A T A G A X A C a909_al2.seq 

. " T C C A A AG A A A X C X A A A A A AC G A X t A X X C A C A G A A A A A X C A CX X G A C C C. A MMnr.^ 

12 ? 10 12 3^> 12330" '12340 -12350 

!__4_ CTrrAAAOAl t A^^M^M CCATTATTG * CACA AAA coM_ a i2. seq 
12242 CTCCAAAGAAAXCXAAAAAACGAIXAXXGACAGAAAAAXGACXXGACCCA a909: a i2 
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T C G T C IU Y CA, A T,„A .P^C G j^ffi p ■Q-jfi.T.tG CTTCTT TCCA AAAGATACCCC Maiorl^ 
^f 60 ^ 70 12380 12390 124m 



12380 

JL 



12390 
jl 



12400 
a. 



12292 TCCTCTAAAACAATACTAGGTAAACCCTGCTTCTTTGCAAA.AGATAGCCC aSO^li^ 
TTCTAGCGTTGTTA ACTG AAAACCTGTATTACAAATCACAAAATCAATAT Majority 

12410 ' : »~ 



12420 

f 



12430 



12440 

JL 



12450 



12342 TTCT A GCCTTGTTAArTr^a^»^ G ^ C ^ A ^ cohLa.z.sea 
12342 TTCTAGCGTTGTTAACTGAAAACGTGTATTACAAATCACAAAATCAATAT a909_at2.seq 



orlty 



1^2 TTTC ATCTGAAAP AToIIt^^^^ ^ ^ ^^ ^^^^^^^^CATTTTTCTtA cohl ai2.seq 

12392 TTTC ATC T G A AACATGTTTCATCAGCGTGTTGTA T T C T C G A TTTTTCTTA aSOsIalZ^ 

ATAATAGGATAG CGCT GC TTG AC AATCTTT T T GGTCGGTAAACGGTAAAT Majority 

vocrarv \ — " 1 • 




12510 



125 20 

JL 



12530 

JL 



12540 

JL 



12550 

JL' 



A T A A T A GGATArcrrTrrT*^!^^^ ^ AT^T TTTTGGTCGGTAA A C C CT A A AT cohl_ai2, S eq 
12442 -A TAATAGGATAGCGCTGCTTGAC A ATGTTTTT GGTCGGTAAACGGTAAAT aSOoIalZ.seq 

TT.TTCTACCCTTGTCTTCATCTAT A ATCGGTAAATCATCATGATTAGTTC Majority 
J iZ ^ m ; »2S70 12580 12590 12600 

1^ TTTTCTACCC T T CTCTTPiiTrT^^^^^^^ ^^^^^^^ CATCATTAGTTG cohi_ai2.seq 
12492 TTTTCTACCCTTGTCTTCATCTATAATCGGT.AAATC aQ CATCATTAGTTG a909_ai2.se!, 

TT A C AATAACAACACGGTAGCCA C GCTTAACC AAA T C T G C T G T C A TT T T A- Majority 



12610 

ji 



12620 
J 



12630 

-X. 



12640 

JL 



12650 
J- 



Ht£ T T A C A AT A A C A A C A C r c I~a r r ^ ^ CTTAA.CCA AAT CTC CTGTC. ATTTTA cohl__ai2:seq 
12542 TTACA-ATAACAA CACGGTAGCCACGCTTAACCAAATCTGCTGTCATTTTA a90.9_ai2.seq 

- TCrCTATAACGTTCAATACCTC-CCACCAA C CG TAG A T A A T A TCC T G A G A A Majority 

1 t^y ir* /*\ - * - — — — - v ' 



12660 



12670 



12680 

JL 



-12690 
JL 



12700 

JL 



S £ CTGTATAACGTTrAAT cohi.ai2.seq 
12592 TCTGTATAACG.TTCAATACCTCCGAGGAAGGGTAGATAATATCCTGAGAA a909_ai2.seq 

AACAGCAACTG TTTTtAC CTTATTTTCCATATTTATCCACTTTCATCAAT Majority 
/ 12 ? 10 12 ? 20 . 12730 12740 12750 

1^642 AAC AC C A A pI'ptIII'II^I^ ^ TT-A TTT TC C ATATTT A.T C C,A C.TTTC A TC A At cohi_al2.seq 
12642 AACAGCAACTGTTTTTACCTTATTTTCCATATTTATCCACTTTCATCAAT a909~ai2 se^ 



AAGCCATCTTT TAAGCC TTTAATCATAGCAACTAATTTTTTGCtCTTTTG Majority 

12780 12790 12800 



12760 

JL 



12770 

JL 



12692 X A G GC ATC I T 11 A 1 r r > II H * T ° * T * ° ° 1 * G T A A T T T T T T G C T C T T T T G cOhl^.seq 
12692 A A G C C ATCTTTTA A.CC 0TIT AATCAT ACC A ACT A ATTTTT IGCTC tTTTC a909_ai2.seq 



C T C T T C T C C T A C C A A C A C J C G A A C A A A T T C A T T T G G C ATA A A T A C T A A A T Major! ty 

1 2810 . »2?20 I 12830 . - . 12S40 12850 



-£S CTCTTCTGC 5-X S P A A > f S£ ^ A O A A A t .t C A T T T C .G C A T A A A T A C T A A jA. T cohl^ia.seq 
12742 CTC T.T.C T G C T A C C -A A C AC TC G AAC A. A AT T C A T T TC G.C AT A A AT A CI A k A T/igOS^.^ 

A t t :t C,* GCCGCtTCT T C TT AC CA T.A.T T T TT t T A T- A-' A t A T A A A TC G C A -T T G MaiLl^ 
• . i2 ?^° ■ ; ^2870 12880 12 890 \ \-. 12900 

S AT TT C ? G C C t C rl c 1 1 H I * 55 ^ I ^ T TT > ^ T T A T A- A T A t i A A T C G C A fV^ cohl.al2.seq 
12792 ATTTGTGCCGCTTCTTCTTACCATATTT.TTTTA.TAAT ATA A A T C G C A T.T G .a909_.ai2.ieq 

C.G T ATC ATGTA AT AT TTT C.G A- A AT GGTGAATGA TT C A A T A C A T G A A A A-A C Majority 
* . -» . 13910 ., 12920 . 12930 12940 • . 12950 

™% C G T AT C AT CTA AT ATTTrrr a "a A T G G T G A A t g a t t C.A A T A C A T G.A A A A A C cohUai2:seq 
12842 CGTATCATGTAATATTTTCGAAATGGTGAATGATTCAATACATGAAAAAC a909_ai2 .seq 



: A T G G C C A A A T T T f T T A A C T C G T C A A G A G T G T C C A A f T T C G T G T A A A A G A 6 Majority 

12980 * 12990 * 



12960 

JL 



12970 

JL 



13000 

JL 



S ^^^M^ TTTTTTAACTCGTCAAGACT GTCCAATTTCGTGTAAAAGACcoM a l2 . S eo 
12892 ATGGCCAAATTTTTTAACTCGTCAAGAGTGTCCAATTTCGTGTAAAAGAC agoalalZ.I^ 



- J 
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TTCATTTCAGAGTCAACAAAATCAATAAACAT CTCTTCTGCAAAGCCACA Majority 

13060 13070 13080 13090 13 loO 



TCTTTCTTCAAAAACGCTCCTTTTCATTAA A GCAGCCGAAGTAATACACT Majority 



131 10 



13120 

JL 



13130 



13140 

JL 



13150 

JL 



~ i»»:»::ttii::H:::::;:in2ii::s:n::h:iHiti» sail- 



CTTC AATTKHTATACTCAAATTCTTCCATCAn T AAATCTTCACCCTTn Majority 

131 80 13190 13200 



13160 *3170 13ieu I3igo 1320Q 

13139 CTTCAATTTCTTTATACtCAAATTCTTGCA tBa CT A A ATCTTf Af Pf TTf ,o 
13092 CTTCAATTTCTTTATAGTCAAATTCTTGCA TC A CTAAATCTTCAQGGTTC a9^Zal2^seq 

ATATCTTGATACAAACAAGATAACATACCf. ACCTTAGCTAAATCAAr.r.T| Majority 

13210 13220 13230 13240 l32 50 

J. 



13189 -ATAT. CTTGATACAAACAAGATAACATACC GACCTTAGGTAA aWt A « r p t 1 ... 
13142 ATATCTTGATACAA AC AA-GATAA.CATACCGACCTTAGGTAA AT G' A A G G"T A -aslwljaj^seq 

ATTTTCATAATTATCTATCAAATCACCTACG A CAACCCAATCTTCATnTA Majority 

13260 132 ™ 13280 13290 ■ . 1330 b 



AACTCAAGAACCAATCAAATTCTTCTGCTAC T -G-C AAATTGAC CG AT'Af. AG Hajorlty 

13310 13320 1 33 30 13340 . 13350 



B - I • — ■ 1 > 

TTCAAA.GCATA T C.C AATTCCTTTATTTTCT G' T T A A ATAATC A ACAGTTAfl Majority 



13360 



13370 



13380 

-J- 



13390 



■ »• ^ j . L 13400 

13339 TTCAAAGCAT ATGCA AT TCCTTTATTTTPTPTT A * a t a a t» r» 4 a ^ a ~ - , 1 
13292 TTCAAAGGATATGCAATTCCTTTATtTTC T.G TT^AAATAATCA AC AGTTAG a909^12!seq 

GTGCCCCTCTTCATT ATAA TCGGCCACTAATTGAGAAATTTCTTCCTTAT Majority 



. 13410 



13420 

JL 



13430 
JL 



13440 

JL 



13450 



13389 CTGCCCCTCTTCATTATAATCGGC CACTAATTGAGAAATtTCTTrrTTA-i- K, ,, 
13342 CTGCCCCTCTTGATTATAATCGGCCACTAATTGAGAAATTT cQt CCIT AT a909Zal2*seq 




- . i - f . . — . . 13550 

A. A Majority 




■ QC t G A A T G A C T G G T G 0 T T T G G T T A T A A A A A C. g A T A C C C A C A T A G A T A G T T- A Majority 

13610 13620 13630 13640 . 13650 



J3S42 CTCAATCACTCGTGCTTTGGTTATAAAAACGATACCCACATAGATAGTTA SosIaltlSj 
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13660 
_i_ 



136TO 



13680 
.1. 



13700 
JL 



13639 CTGCTACT AAACTTTGAATCAC ATAATTTACCAATGATACTCACATTT G A cohl al2 sea 
13592 CTGCTACTAAACTTTGAATG AC ATAATTTACCAATGATACTCACAfTTCA SS^Sl^ 

GTATTGATATA ATA GAGTACAGCTCCACTAAGAGTACCAGCAATTAAATA Malorltv 

13720 



13710 



13730 



13740 
J. 



137SO 
t 



1"*642 C T A T T r a ? 1 i M i Wt* TACAGCTCCAC TAAGAGTAGCAGCAAT T A A A T A cohl_ai2_seq 
13642 CTATTGATATAATAGAGTACAGCTCCACTAAGAGTAGCAGCAATTAAATA a909_ai2. S eq 

GCCCAGCATTCCfCTTGTTA A TTCTTTAAAAGTAAATACATCTCTTAAAG- Malorltv 



13760 



13770 



13780 
JL 



13790 - 



1380O 

JL. 



S rr^ A rrH T C C T C T T G T T A A T T C T T T A A A A G T A A A T A C A T C T C T t A A A G* cohl ai2.se*, . 
13692 GCGCAGC ATTCCTCTTGTTAATTCTTTAAAAGTAA A.T AC ATCTCTTA'AAG a909~al2.s^ 



AGATAGCTTGATATAGGGAGA CAATAAATTCACTAATAACTGTAGAGATA Majority 



13810 
-L 



T 

13820 

JL 



13830 
JL 



13840 



13850 

JL 



13789 AGATAGCTTG AT A0A G G C A G A C A. A T A A A T TC A € T A AT A A C T G T A G A G A T A colli ai2 sea 

13742 agatagcttgatatagBgagacaataaattcagtaataactgtagagata SS^iSJ 



ATAGCTCC C AT AGC ACCTAAAATTGGTATTAAA AGTATATTAAGCAGAAC Majority. 



13860 



13870 
X 



13880 
J_ 



13890 

JL 



13900 
j_ 



"'"{"•JJfS A TACCACCTAAAATTGGTATTAAAAGTATATTAAGCACAAC oohi_al2.seq 
13792 ATAGCTCCCATAGCACCTAAAATTGGTATTAAAAGTATATTAAGCACAAC a909_ai2.seq 

ATTTCCCACAAGTCC A AT A ACTCC AGACATTGTGTA ACCTTTTGTAC^C Majority 
. . 13 ? 10 t3920 13930 13940 13950 - 

\f™ llZlcrrlir A A G A A I A f C T G C A G A C A ^ T G T G T A A G C T T T T G T A C G T C cdhl^a^seq 

13842 ATTTGCCACAAGTCCAATAACTGCAGACATTGTGTAAGCTTT T G T ACGTC a909_al2,seq 

- TTG AAGCCAGTAGATACTG TGTCC€TAAAGCGTTACCATAAGAA A T G C A A Malorltv 

i . » • . " 1 1 1 1 . i ~ 



13960 



13970 

JL 



13980 



13990 
i _L 



14000 
J. 



13939 TTGAA G CC A G TAG AT A C T G T G TC C C T A A AG C G TT A C C A T A AU A ATGC A A cohlalZ sea 
13892 TTG AAGCCA.GTAG AT ACTGTGT.C CCTA A AGCCTTACGATAAG A AATG C : A A So9_~al2!seq 



ATGATCATCAAA 
: rr~- 

14010 

13989 ATGATCATCAAA 
13942 ATGATCATCAAA 



Majority 



cohl_al2.seq 
a909_jal2.seq 



Decoration 'Decoration #1*; Shade {with solid black) residues that differ from. the Consensus. 



m t . . . 



,. f. -^Jlf^t-,-- —^l ">.•'** fW-V£ t^:> 



V* 
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MKLSKKLL 



: it . t ( . )|" ' U -' "')P ! f 1 — H tf-'l''*'' if'^'h ■»'"■■; ,i itim, n.,t M ...... 

ESS a1Iv..l mwd s;;ftvv Vrt ! pj V c m 

I = ~l — ~ * 1 



SI VRAAEVSQERPAKTTV Majority 



- — 1 1 - - -J : L 



51 
il 
>1 

>i 
si 



OMJ UP ImZ f« STVEPVA< J FA TG«SIVRAAEVSQERPAKTTV sag64S_2603.pe P 

ur^rJ^rct^I^ H VAtSIVEfViQFiTCHSIVRAAEVSQEIPAKTTV sa^S a909 Lp 

«K^^K^, ^M«^^y r AGSTVEPVAQFATGMSIVRAAEVS< i ERPAKTTV sagMsIcohl.pep 
M * L S K K L L F S A A V L T M V A C S T V E P V A Q F A T C M S I V R A A E V S Q E R P A K T T V sa^Sj.emSie^ep 

NIYKLQADSYKSEITSNGG 1 ENKPCEVISWYAKLGDMVKGLQGV0FKRYK Majority - 

f° ? *2 90 100 

K^Kj"n A ^v^^H^ CIENKDGEVISKYAKLC,) ' ,VKGL « GV « FKRYK sag645_2603.pep 
NIYKLQADSYKSEITSNGGIEKKDCEVISNYAKLGDNVKGLQ C V OF K R Y K sae645 a909 oeo 

1 YKL4JADSYKSEITSNGG I ENKDGEV ISNYAKLGDNVKGLOCVQFKRYK saG645 cohl oeo 
NIYKLQADSYKSEITSNGGIENKDGEVISHYAKLG0KVKGLQCVQFKRYK iStpep 



VKTDISVDE 



IKKLTTVEAADAKVCTIL EECVSLPQKTKA^CIVYDALDSK Majority 
l }° i?0 130 140 TsO 

1 — 1 — : U s L 



51 
51 

51 



01 V-KTD ISVDELKKLTT V E A A D A K V G TI L E E G VS L PQ KT N A Q G L VV D A L D S K sa«645 2603 pep 
o! V^^lv^™^™^ 

o\™^jfv£*i**^™M^ sa^cj-burpep 

^^^I^fX LKKLTTVEAADAKVGTILEEGVSLP Q KTNA QGLVVDALDSK sag645 cohl pen 
01 -V K -T.P I SYDELKKLTTVEAADAKVGT ILEEGVSLPQKTNAQGLVVB'ALDS K Ssl^i^ep 

S N VR Y L Y V E P L KNS P S N I T K A Y A V P F V L E L P V AN S T G T.G F L S E I N I Y P K N Majority 
l f° 'T° 180 190 200 

t \ 1mv^^v^S^^o^J T ^ AYAVPFVLE LP VANSTCfGF LSE IKIYPkI sag6^2603.pep 
51 SN VRYLYVEDl K.N.S PSN I T K ATA VP FVLELP YA RSTGTG'FLSE INI YP K-N safi645 a909 nen 
*i SNVRYLYVEDLKNSPSN I TKAYAVPF V.L ELP VANSTCTCFLSEIK IYPKN ^S^Sif^ 

S N V R Y L Y V.E D L K-N S P S N I T K A Y A V P F V L E L P V A N S T G T G F L S-E I N I Y P K N S coh 
S KVKYLYYEDLKHSPSN I TKA'YA VP FVLELP V-A ir.S.TGT6F L'S E I N I YP KM SSl^p 

VVTPEPKTPKPVKKLGQPPA GYT I G EEF K WFLK ST I P A N-L G D YEKFE I T D Majority 

■* * ^ 1 ~ - — : — I - I J J 

■ 2 }° 220 230 . 240 : 250 

VVTDEP. KTBKDVK.KLGQDDAGYT I GEBFKWFLKSTIPANLCD YEKFE I T D sae64S 2603 oeo 
VVTJJEPKTDKDVKKL.GQDDAGYTIGEEFK. WFLKST1PANLGDYEKFEITD SS^aSol' ^ 
Ol.VVTBEPKTDKDVKKLGQDDAGYTICEEFKWFLKSTIPAMLGDYEKFEITO t^ttfmiTveo 

VVTDEPKTDKDVKKLGQDDACYTIGEEFKWFLKST1PANLGDYE.KFEITD sa^S^iltp^ 

KFADGLTYKSVCKIKJGS KTLHRDEBYTIDEPTVPHQNTLKITFKPEKFK Majority 
■ ..' " 260 270 . 280 290 300 

KFADGLTYKSVCKIKIGSKTLMRDEHYTIDEPTVDNQMTLKITFKPEKFK sag645 2603 pep 
^n^rf-Jvr SVGXrKICSKTLHRDEHyT.rDEPTVDHQNTL.KrTFKPEKFK ^S^OO.'^ 
KF ADC LI Y KSyCKIKICSKTLNRDEHYTIDEPTVDNQNTLKITFKPEKFK safeVciblll oeo 
KFADGLT.YK.SVGKIKI CUT L'N R D E H Y-T IDEPTVDHQNTLK I T F K P E K F JK SSls Shi peT 
il K F A D G i T Y K S V G K I K I G S.K T L N R D E H Y T I B E P T V D H Q N T L JC I T F K P. i K SgiS^ieXp 

,E I A E-L. L K G HTIVIN Q P A>L P K AT AN T P PA AF L EI P.V AST I ft Jb K A V L jj A I E Waioritiy 
■ . 3 ?° 3 ?° r . ,330, 346 . 350. : 

p Mfl ilnul? I r 2 « » A VDK ATAN T PD A A F LB t JP V AST I k E K A V t GK A. I E sag645j 2 603.pep - 
» I - ^ " K 5 M T L V K S Q ^ A L D K A T A N T D - D A AFLE I P VAST XNEKA V.LC.K A I B ^45 a909.pep 
n t A ^J'^J MT ^^ K ^^ DA ^ DKATAM ^' DD -A-AFLEi P VAST I NEKAVLCK All sa^45 ^bl I l!Lp 
11 tl l* U T L V K N Q D A L D K A T A N T P P A A F-L E I P V A S T I N E K A V L G K A 1 E sa^is'Shl ^ 
E I A E L LKGMTL VK N. Q DALDKATANTDDAAFLE IPVASTJHEKAVLGKA I E 2^45^ ^ep 



Di 
01 
01 
Di 
01 



>1 

si 

il 
il 



)1 
)1 
)1 

u 
)1 



,N T F E L Q Y P H T P P K AP N P K P S N P P R K P E V H T C G K R. F V KK P S T E T 0 TU G A E Majority 

. 360 . 370 . 380 -39 0 .. .". ,. : ' .'.400 

I! ™I FE ^^ D " TPDKADKPKp S N PPKKPEVHTGCKKFVKKDSTETQTLG.GAE sag645 2603.pep 
dt^aISJII!! ^ A ^ " P ^ p ? NPPHKPEVHTGGKRFVKKDSTE T Q ILC'GAE sa^s'agog 

■J oc^ I P E ^ " ^ *»® KRFVKKDSTETQT L G G A E sag645 ooKl.pep 

.1NTFE.LQYDHTPDKADHPKPSNPP.RKPEVHTGGKRFVKKDSTETQTLGGAE ^45^^^ 



Paget 
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101 
101 

iOl 

toi 

IOl 



151 
151 
151 
51 
51 



Ol 
01 
01 
01 
01 



• ' i. It" ' " ' "I' 1 ' 1 " f* 1 — J4 — ?l i?""' ') '"'' , ■' t' '""it 




FDLLASDCTAVKWTDAL 
F • D L L A S D GTAVKWTDAL 
FDLLASDCTAVKWTDAL 
FDLLASDGTAVKWTDAL 
FDLLASDGTAVKWTDAL 



KANTNKNY 
KANTNKNY 
KANTNKNY 
KANTNKNY 



I KANTNKNY 



I A C E A VTCQP 
IAGEAVTGQP 
IAGEAVTGQP 
IAGEAVTGQP 
IAGEAVTGQP 



I KLKSHTD 
I KLKSHTD 

I K L K S HT D 

I KLKSHTD 

1 KLKSHTD 



G T F E I K sag645_2603.pep 

G T F E I K sag645„a909.pep 

G T F E I K sag645_cj bill. pep 

G T F E I K sag645_cohl.pep 

G T F E I K sag645_nem316.pep 



G L A Y A V D A N A E GTAV T Y K L K ETKAPE C Y V I 

- r 



— i — 
460 



r 

470 



L AY A V D A NA EGTAVT Y K L K ETKAP 



480 
i_ 



PDKE IEFTVSQTSYNTKPTD 

' i 



Majority 



490 
i 



G L A Y A 

G L A Y A 

G L A Y A 

G L A Y A 



I T V D 



V D 

V D 

V D 

V D 



ANAEGT.AVTYKLJCETKAPEGYVI 
ANAECTAVTYKLKETKAPEG Y V I 
ANAEGTAVTYKLKETKAPEGYVI 
ANAEGTAVTYKLKETKAPEGYVIPDKE 



EGYVI PDKE IEFTVSQTSY- 
PDKE IEFTVSQTSY 
PDKE IEFTVSQTSY 
PDKE IEFTVSQTSY 
IEFTVSQTSY 



500 




N t K P T D sag645_2603.pep 
NTKPTD sag645_a909.pep 
N T K P T D sag64S_jcj bill. pep 
NTKPTD. sag645_joohl.pep 
N T K P T D sag645_jnem3I6.pep 



KCKKRR Majority 



I T V D 

I T V D 

I T V D 

I T V D 

I TV D 

T K D N 



S 
S 
S 
S 
S 



A D 

A D 

A D 

A D 

A D 



A T P D T 
A T P D T 
ATP pT 
ATPBT 
A T P D T 



KNNKRPS I 
KNNKRPS I 
KNNKRPSI 
K N.N K R P S I 
KNNKRPS I 



P N T G G 
P N T G G 
PNTCC 
P N. T C G 
PNTCC 



G T A 
G T A 
G T A 
G T A 
G T A 



I 
I 
I 
I 
I 



F V A 

F V A 

F V A 

F V A 

F V A 



ICAAVMAFAV 
ICAAVMAFAV 
ICAA.VMAFAV 
IGAAVMAFAV 
ICAAVMAFAV 



K G ftt K R R-sag645_2603.pep 
K G M K R R sag6 45_a909. pep 
K G M K R R sag645_cj bill, pep 
K G M K R R -sag645_cqhl .pep 
K G M K R R sag645_nen316.pep 

Majority 



Page 2 



SI T K D N 

51 T K D N 

51 T K D N 

>I- TKDH 

51 T K D N 



BGoration • Decoration #1': Shade (with solid black) residues that differ from the Consensus 



sag645__2603 . pep 
sag645_a909. pep 
sag645^cj bill. pep 
sag645_coh 1 .pep 
sag645_neo3 16 . pep 



- 



' • • J- 
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U K K R 
HKKR 
MKKR 
HKKR 
MKKR 
MKKR 



Q K 

Q K 

Q K 

Q K 

Q K 

Q K 



I WRGLSVTLL 
I WRGLSVTLL 
I WRGLSVTLL 
I WRGLSVTLL 
I WRGLSVTLL 
IWRGLSVTLL 



L S Q 
LSQ 
L S Q 
LSQ 
LSQ 
LSQ 



P F G 

P F G 

P F G 

P F G 

P F G 

P F G 



L V Q 

L V Q 

L V Q 

L V Q 

L V Q 

L V Q 



G 
G 
G 
G 
G 
G 



E T Q D T N Q 
E T Q D T N Q 
ETQDTIIQ 
DTNQ 
DTNQ 



I 
I 
I 
I 

I 

ETQDTNQALGKVI 



E T Q 
E T Q 



A L G K V 
A L G K V 
ALGKV 
A L C K V 
ALGKV 



V K K T G D N 

V K K T G D N 

V K K T G D N 

V K K T G D N 

V K K T G D N 

V K K T G D N 



A 
A 
A 
A 
A 
A 



sag649_i2603 . pep 
sag649_cohl .pep 
sag649_cj bill, pep 
sag649_JL8rs21 .pep 
sag649__neia3 1 6 . pep 
sag649__cjbll 1 .pep 



TPLGKATFVLKW PNDK SETSHETVEGSGEATFEK I KPCDYTLREETAP I G Majority 



51 
51 
51 
51 
51 
51 



TPLGKATFVL K.N D N D K S ETSHETVECSGEATF EN IKPG DYTLREETAP I r <^o«jo 9 «n, 

TPLGKATFVLKNDNDKS ETSHETVEGSGEATFEN I'KPGDYTLrIetAP I G Zrttl^V™ 

TPLGKATFVLKNn I KPCDYTLREETAP I G S^49~18rs21 !pep 

TPLGKATFVLKNDNDKS ETSHETVEGSGEATFEN I K" P r n v *r i »r»t a d i iL* *Z 

TPLGKATFVLKNDNDKS ETSHETVEGSGEATFEN H^CtTTlIIETAr I G sagM9l!cJblll!pep 



101 
101 
101 
101 
101 
101 



YKKTPKTWKVKV A D N G A T 

110 

YKKTDKTWKVKVADNGAT 
YKKTDKTWKVKVAD NGAT 
Y K K TDKT.WKVKVAD NGAT 
YKKTDKTWKVKVADNGAT 
YKKTDKTWKVKVADNGAT 
YKKTDKTWKVKVADNGAT 



I I EGMDADKAEK RKEVLNAQYPKSA 1YEDTK. E Majority 
120 130 140 "Tsq^ 



I 
I 
I 
I 
I 
I 



I E G If DAD K A E KRKEVLNAQYPKSA 
I EGMDADKAEKRKEVLNAQYPKSA 
I E G H D A D K A E KRKEVLNAQYPKSA 
I E G M D A D K A.E K R K E V L N A Q Y P K S A 
I EGMDADKAEKRKEVLNAQYPKSA 
I E G M D A'DKAE K R K E V L- N A QYP-KS A 



YEDTKE 

Y E D T K E 
YEDTKE 

Y ED T K E 
YEDTKE 



sag649_j2603.pep 
sag649_eohI .pep 
sag649_cj bill, pep 
sag649_l8cs2i . pep 

sagG49_jieai3I6lpep 
YEDTKE sag649^cj bill, pep 



NYPLVNVEGSKVGEOYKALNPINCKDGRR E IAEGWLSKKITCVNDLBKNK Majority 

160 170 180 190 20O 



151 SyPLvSve GS K V c I 21 v 2 11 S ^ " G R R * 1 A E G W L S KK X r GVKDLDK N K sag649.2603.pep 
lt>l H iFLVNVEGSKVGEQYKALNP INGKDGRREI AEGWI JcIIt r v m n i n v w it ^« 

151 N- Y P LVNVEGSKVGEQYK A L N PI NGK.DGRRE 1 AEG W L S K llT G V 2 S L I 1 1 K if* 

\l\ 'j^^pJ*^jjJ^EGSKV-GE'QYKALNP ' "GKD'GRkE I^BGWLs'kkI^c'viIdLD KNK 2g649~Tsrs2i pep 

151 N TP LV.NVEGS K V G E Q.Y KALNP 1 NGKDGRRE IAEGWLS.KKITGVNDLDKN'K sag649_cjblll:pep 



_YK I BLTVB. GKTT V-E T K 



R A I K AGE MaJ or! ty 



201 
201 
201 
201 
201 
201 




Y K 

Y K 

Y K 

Y K 

Y K 

Y K 



ELTVEGKTTVETK 
ELTVEGKTTVETK 
ELTVEGKTTVETK 
ELTVEGKTTVETK 
ELTVEGKTTVETK 
ELTVEGKTTVETK 



E 
E 
E 
E 
E 
E 



L N 
L N 
L N 
L N 
L N 
L N 



Q 
Q 
Q 
Q 
Q 
Q 



p 
P 
p 
P 
P 
P 



L 
L 
L 
L 
L 
L 



D V V V L 

D V V V L 

D V V V L 

D V V V L 

D V V V L 

D V V V L 



L 
L 
L 
L 
L 
L 



D N S N 
D N S N 
D HSU 
D NS N 
D N S N 
D NS N 



SMNNERANNSQRALKAGE sag649_2603.pep 
SMNNERANNSQRALKAGE 3ag649_cohi .pep 
SHNNERANN S Q R A L K A G E sag649_cj bill, pep 
SMNNERANNSQRALKAGE sag649 18rs21.pep 
SMNNERANNSQRALKAGE sag649.jieib316.pep 
SMNNERANNSQRALKAGE sag649_ fc cj bill. pep 



AVEKLIDK 



251 

251 

251 

: 2S1 

251. 

251" 



A V E K L I 
AVEKLI 
A.VEKLI 
•A V E K i I 
A V E K L:I 
A V E K L I 



ITS 
— i 

260 



N K P N R V A L V T Y A S T I F D 



I 

270 



G T 
r 



E A T V S K G 



V A D 
r 



D K I T S 
DKI .T S 
DK1TS 
B K I TS 
D K I T S 

b k.i t s 



N. KDNRVALVTYAST 
NKD N RVALVTYASt 
N K D.N R V A L V T Y A S T 
N K D N R V ALVTYAST 
NKD K RV.A.L YT Y.AS T 
N'K D N R.V A L V T Y X S T 



280 



QNGKALNDSV Majority 



290 
1 



300 
I 



F D G T E A T V S KG V A D Q N G K A L N D S V sag649_2603.pep 
FDGTEATVSKGVAD QNGKALNDSV sag649_ J pohl pep 
F D G T E A T V S K G V A D Q. N G K A L.N D S V .sag649_cjbill:pep 
F D G T E A T V S :■ K G V A t) Q K G K A L N D S V sag649„18rs2l .pep 



IFDGTEATV S K. G V A D Q N G K A L N D. S V .sag649^eiq316Vpep 
I F D G T E A T V S K G V A D-Q ;N G K A X N D S V sag649^cj bill. pep 



S W D Y 



H K X T FT A T : 
* — . j 



T'H KYSYL 



N 



L T N D A N E V N 



.301 
301 
301 
301 
301 
301 



310 



I L 
1 — ~ 



K S R 



SWDYHKTTFT.ATTHN 
S W-i) Y H K T*.T FTAT TEN 
S W D. YHKTTFT.At.TiN 
S W D YH K'T.T F T A T T H N 
SWDYHKTTFTAT.THN 
S W D Y H K T T.FTATTHN 



320 

r 



330 



IP K. E AO 1 N j'D RT. L Y,<0 Majority. 



YSYLN LTNDANEVN 
YSYLNLT.NDANEVN 

Y S YLHLTN.BANE V N 
YSYLNLT NDANEVN. 
Y.S YLN L.THDAN E V N 

Y S Y L N L.T NDANEVN 



L.K S 
L K S 
L K S 
L K S 
L K S 
L K S 



340 

JL 



R 

R 
R 
R 
R 
R 



.350 
I „ 



FGATFTQKALMKA N*E I L E T 0 S S N A R K K L I 



I P,K E A E tf I N G D R T L Y Q sag649J2603.pep . 
IPKEAEHINGDRTLYQ sag649_cohl.pep 
I P K E A E H I N G D R T.L Y Q sag649_cj bill. -pep 
IPKEAEHINGDRTLYQ sag649_18rs21.pep 
IPKEAEHINGDRTLYQ sag649_^ieia3I6.pep 
IPKEAEHINGDRTLYQ sag649_cj bill. pep 



T 

360 



370 
I 



' 7 HVTDGVPTHS Y A I N F N P Y. I S Majority 

400 



351 FGATFTQKA L'M KANE ILETQSSNARKK LI 

351 F G A T F.T QKALMKANE I LETQSSNA R.K K L I. 

351 FGATFTQKALMKANEILETQSSNARKKLI 

351 FGATFTQKALMKANEILETQSSNARKKLI 

351 FGATFTQKALMKANEILETQSSNARKKLI 

351 FGATFTQKALMKANEILETQSSNARKKLI 



i 

380/ 
■ t 



I 

390 
JL 



FHVTDGVPTMSYAINFN 
FHVTDGVPTMSYAINFN 

F N 
F N 
F N 



FHVTDGVPTMSYAI 
FHVTDGVPTMSYA I 
FHVTDGVPTMSYAI 
FHVTDGVPTMSYAI NFN 



N 
N 
N 



Y 
Y 
Y 
Y 
Y 
Y 



S 
S 
S 
S 

s 
s 



sag649_J2603.pep 
sag649_icoh 1 .pep 
sag649jcjblll.pep 
sag649_18rs2I .pep 
sag649_neai3 1 6 . pep 
sag649_cjbll!.pep 




jW table. ^*P< 



^TK^WO 2006/078318lmethod^mPAM25OresidueweighttabIe. ^PCT/US2005/027239 



Thursday, Jirfy 29, 2W4 73«> km 

T § Y 9 H <? p ^i^ PQ^ gffgj g fj gj D Y Q IVKCPGESFKLFSDRlCy tta,».W 

420 430 



i ~J_2 — 440 450 

Tl T t Y 2 I O F M ^ F l" w !r t o n i> <? jC L Q E D F I I NGD.DYQ IVKGDGESFKLFSDRKV sag649_2603.pep 
40I T^InlnpScBi^IJ INGDDYQ IVKGDGESFKLFSDRKV 33^649 cohl pep 

^»c^^ IPDRSG!LQEDFI INGDDYQ IVKGDGESFKLFSDRKV sagG49~cJbll^pep 
To\ TSYONOFNSFLNK T P i! P <; r r i* n !? ^ i? 1 INGDDY ^ IVKCDC.ESFKLFSDRKV sag649Zl8rs21 !pep 
401 T^vnNnFWQFi'M«rT*Dn ILQEDF I INGDDYQ IVKGDGESFKLFSDRKV sag649_ne«316.pep 

401 TSYQNQFHSFLHK IPDRSG ILQEDFI INGDDYQ IVKGDGESFKLFSDRKV sagWcJblll .pep 

PVTGGTTQAA YRVPQ N Q LSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKT Majority 
_ 4 f° <2 ___! 490 500 

- 451 ^VTGGTTQAAYRVPQNQLSVILSNECYAINSGYIYLYWRDYNWVYPFDPKT sac649 2603 oeo 
451 -P VTGGTTQ A AYRVPQ NQLSVHSNECYA I NSC Y I YLYOD SSioWpeo 

451-PVTGGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKT ^49^1^0 
451 PVTGGTTQAAYRVPQ NQLSVMSNE G*Y AINSGY I YLYWRDYNW V Y PFDP K T 2^49^^ *~p 
«! ^TGGTTQAAYRVPQNQLSVHSNEGYAINSGY! YLYWRD Y • V J 

451 PVTGGTTQAAYRVPQNQLSVMSNEGYA IWS.G Y t YLYWRD YNWV'YF FD-PKT ^ol^bin^p 
K K V S AT K Q I K.THGEP TTLYFNGN I RP KG YD I FT VG IGVNGDPGATPLEAE Majority 
. S j° S ?° 530 54 0 550 

™) ^ni^SI^S^ PTTLYirNGNIRI>KGYD 1 F TVG IGVNGDPGATPLEAE sag649_.2603.pep 

frrvc*T-v'n 15^5$ ^ R P K C Y D I FTVG I GVNGD PGATPLEAE Sg649 cohi.^ 

501 KKVSATKQ I K T.H GEPTTLYFN G'N IRPKGYDI FTVGIGVNGDPGATPLEAE sae649~clblilLo 

501 K K V S AT K Q I IC T H G E P T H Y F N G K I R P K G Y D I F T V G I V N G DP G A T P L I A E stS^r^ 

2!f SIS 1 KTHGEPTTLYF »«^ RPKCYD I FTVG I GVNGD PC ATPLEH Sg649_ieSl6"pep 

501. K K V S A T K Q I KTHGEPTT L Y FNGNIRPKG Y.D IF T V. G I G V N GDP G -A TPLEAE SSSKu-W 

KFMQS I SSKTENYTN VDDT NK I YDELNKYFKT I VEEKHS I VDCNVTDP MC Majority 
' . S6 ° ] 570 580 * 590 600 

55? ^"^! SSKTENYTN VDDT ^ KIYD? L N K Y FKT I VEEKHS IVDGNVTDPMG sag649_2603.,>ep 
2 K F « Q S S S KT E N Y T N V D D T N K I Y D E LN-K Y F KTI VEEKHS IVDGNVTDPMG ^49^ oep 
551 KFMQS I S SK- TEN YTNVDDTNKI "YDELNKYFKT I VEEKHS I VDGNVTDP'MG sa549~c7 b iL 
551KFMQSISSKTENYTNVDDTNKI YDELNKYFKTIVEEKHSIVPGNVTDPMG SS^fir^ 
551 K F M Q S ISSKTENYTNVDDTNK I YDELNKYFKT I VEEKHS IVDGNVTDPMG ^49.^6^ 
551 KFMQS I SSKTENYTN VDDTNK I Y U E L N K Y F K X IV E E K H S i \v D i N V T D P MG^SS 

E M I EFD L KN G Q S FTH D D Y V I V G N D G S Q L K N G V A LG G PN S D G G I L K D VT VT Majority 

' 6 *° ' . fJO^ 630 .640- ' • ' 650 

2J 5 M \ EFQLKNGQSFTHDD YVLVGNDGSQ LK NGVALGGPNSDGGI L K D V T V T sag649j2603 pep 

601 EMIEFQLKNGQSFTHDDYVLVGNDGSQLKNGVALGGPNSDGGILKDVTVT S^^M^C 

SI "2 _ti ^^^^'^^ < '^^ F 'f^' , ' ) ^ VGNDGSQLK-NGVALGGPN'SDGG I LKDV TV T sag^49~cjblli?pep 
S i nrn^r!i^ SP ^^ DDY ^^^^ G ^^ ^" "CVALCCPMSDC.GI1KDVTVT Sig649~~l8rs21.pep 

601.EMIEFQLKNGQSFTHDDYVLVGNDGSQLKNGVALGGPNSDGGILKDVTVT saR649_ne«316 oId 

601 EH I EFQLKNGQSFTHDD Y V L V G N D G S Q L K N G V A L G G P N S D G C I IOVTVT ^S^U.^p 

Y PKTSOTIKINHLN L G S C 0 K V VX T YD VRL K D NY ISNK'FYNTNNR T'T LS P K Majority 

660 ' 6 70 ■ 680 ' 69 0 700 

YD K.TSQT I K.I NH LN LGSGQKVVLTYDVRLKD NY I SNK'FYN-TNNRTTLS PK sag649_2603.pep 
651 YD KTSQT I.K INHLNL G.S.C'QK.V V.LTYD VRLKD N Y I S N K F Y N T N N R T T L S P K sag649 cobl.pep 

Y H% I I I H HX HtCSCQKWLTY D V R LK D NY I SN K FY N. T N N R T T L S P K ^49^1 f^p 
651 3f D K T S Q T IK I N H L N US, G ft-KiV V L T YD V R L K D. N Y I S N K F Y N T N N R T TLSPK 2^49 l«rs21 ifep 

1 1 * ^ t QTI K I NHL NL GS K VVLT YD V-R L..K DNYISNKF Y.N T N N R.TT1 ^S'S^eb 
651 Y t K fSQfl « I NHLNLGS G : *K V-V Jt T.YD VRIOfi Y I S «. K F Y N f NN'RT'T L._ P K- sag649Ll€^bi |1 "pep 

S E K EPH T I R D F P IP K I R D V R EFPV L t I SN ^ K KM GEV E F! I K V N K PKHSESL Majority 

; - ; ^° . ^30. 740 \ 750 - . 

SI ! l*t ll Tl * D F ! 1 P K 1 R D V R E > P V L T I S N Q K KM G E V E F IK VH KJ) KHS E S V sag649^603.pfe P > !* 
Si S E K E t N T I R D F P I P K I R D V R E F P V L T I S N Q K K M G E V E.F I K V N K U K H S E S L sag649 cohKpep 

^ JEPNT I RDFP I PK I RD'V'RE F.P VLT ISNQK KM G E V E F I K*V N K D K H S ES i sag^49"cJbilt!pe P 
701 SE KEPNT I RD FP I PK I RDVREFPV L.T'l SNQKKMCEVEFI K VN KD KHS ES L . 3ag649~l 8rs2t. pep 
Sf o*^ KBPHT I RD FP I P K I RD-VREFPVLT ISHQKKUGEVEF-IKVNK DK US E -S L sag649_jn_m3l6.pep 
701 SEKEPJiTIRDFP I PKI.RDVRE. FPV L,T I SNQKKMCEVEFI K VN KD KH SE S L sag649 cjbUKpep 
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CT/US2005/027239 0 

rage3 



S P P G Y I „E. V, . K T,iK P V 



801 
801 
801 
801 
801 
801 




S P 0 G Y 
SPOGY 
SFDGY 
S P D G Y 
S P D G Y 
S P D G Y 



EVKTKPVVTF 
EVKTKPVVTF 
EVKTKPVVTF 
EVKTKPVVTF 
EVKTKPVVTF 
EVKTKPVVTF 



T 
T 
T 
T 
T 
T 



QNGEVTNLKAD PNANKKQ 
QNGEV TNLKADPNANKNQ 
QNGE VT NLKADPNANKNQ 
QNGEVTNLKADPNANKNQ 
QKEVTNLKADPNAKKNQ 
QNGEVTNLKADPNANKNQ 



G 
G 
G 
G 
G 
G 



YLEGN 
YLECH 
YLEGN 
YLEGN 
YLEGN 
YLEGN 



G K H 
GKH 
GKH 
G K H 
G K H 
GKH 



L 
L 
L 
L 
L 
L 



T P K RPPGVFPKTGG1 



860 
— I-.. 



GT I V.YILVGSTFH I LT 1 CSFRR K O L 



870 



880 



851 TPKRPPGVFPKT 

851 TPKRPPGVFPKT 

851 TPKRPPGVFPKT 

.851 TPKRPPGVFPKT 

851 TPKRPPGVF P.K T 

851 TPKRPPGVFPKT 



GG.I'GT IVY I LVGS'TFM I 
GG I GT I VYILVGSTFK I 
GGICTIVYUVGSIPHI 
GG ICTIVYILVGSTFUI 
CG IGTIVYILVGSTFUI 
GG I GT I V.Y I LVGSTFM I 



_i 

L T 
L T 
L T 
L T 
L T 
L T 



890 



I CSFRRKQL 
ICSFRRKQL 
ICS. FRRK.OL 
ICSFRRKQL 
T CSFRRKQL 
ICSFRRKQL 



Decoration 'Decoration «• : Shade (with solid black) residues that differ fro* the Consensus. 



I T N Majority 
850 

I T N sagS49_2603pep . 
I T N sag649_cohl.pep 
I T N sag649_cj bill. pep 
I T N sag649_18rs21.pep 
I T N sag649_neii3 1 6 . pep 
I T N sag649_cj bill, pep 

Majority 



sagG49_2603 - pep 
sag649_cohl . pep 
sag649_jcj.blll.pep 
Sag649_18rs21 .pep 
sag649_nea316 . pep 
sag649_cjfalll.pep 



- »• - 



• ; - 
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fagel 



% 

M R j Y Q Kj^gj yj^S i j j £ Pi. PSlS Cjjs 77p E N 1 A K C K L V V j K T D D 0 Ha W„ v 

*° J20 30 40 50 

MRKYOKFSJK Il'Ti'ci'p^f'o^ IPI'HTSVLGESTVPEK.CAKCKLVVRKTDD'd sagl 408_i>603. pep 
I^vS^cJ ^ lSLFCLSQIPLHTNVLGESTVPEKCAk CKLVVKKTDDQ sag^OsisrsSU sea 

HRKYOKFSK f i L ?, L m^! Q 1 PLNTNVLGEST VPENGAKGKLVVKKT DD Q ^408^65^ 
HRKYQKFSK I- L T L S LFCLSQ I PLNT8VLGESTVPENCAK-CKLVVKXTDDQ s^l408l^l6>ep 



51 
51 
51 
51 
51 
51 



MKPtSKATFVLKT TAHP ESK I E K VT A E LT C E AT F D H L IPGDYTLSEETAP Majority 

60 70 &> . 90 .100 



N K P I'^KATpi^KTTAHD^^^ * EKVTAE LT G E A T F BU IPGDYTLS. EE T "a"p"* sagl408_2603 ^pep 
NKPLSKATFVLKTTAHPESKI EKVT AELTGEATFD NH PGDYTLSEETAP sa5l40ft"qiq 

NKPLSKATFVLKTTAHPESKIEKVTAELTGEATFDKL : Y J Sj^is^T^n 

NKPLS'KATFVLKTTAHPESK I EKVTAELTGEATFDHLIPGDYTLSEETA P ^ 

^p^ka™^?^^ 

HKPLSKATFVLKTTAHPESK I EKVTAELTGEATFDKLI PGDYTLS EETAP sagMOa^taioTp^p 
EGYKKTNO 



TWQVKVESN 



101 :E G Y KKT.N.Q 
101 EG YKKTN Q 
101 EGYKKTNQ 
101 E G.Y K K TN Q 
101 ,E G Y K K tBIq 
101 EGYKKTNQ 



110 

_ j. 



GKTT 
1 



120 



* 0 N S G D K N S T I G Q N 0 E E L D K 0 Y P P TG I Y E Majority 



TWQVKV ESNGKTT I 
TWQVKVESNGKTTI 
TWQVKV ESJJGKTT I 
TWQVKV ESNGKTT I 
TWQVKV ESNGKTT I 
TWQVKV ESNGKTT I 



i 

130 
— J 



QNSGDKNSTI 
QNSGDKNSTI 
QNSGDKNSTI 
Q N S GDKN ST I 



140 
i 



DKQYPPTG 
DKQYPPTG 



H «. * m. 4^ y - j r j 

Q N slfl D kHs|I iBqETq EEL 

QNSGDKNSTIG Q N|Q EEL 



G Q N Q E E L 
G Q-N Q E E L 
G Q N Q EEL DKQYPPTG 
G QjLQ eeldkqyp p T G 
PKQY PQIT C 
DKQYPPTG 



150 
t_ 



Y E 

Y E 

Y E 

Y E 



sagl408_2603 .pep 
sagl408_J515 .pep 
sagl408_I8rs2l .seq 
sagl 408_jcj bill- pep 

Y E sagI408ji36b.pep 

Y E sagl408_jae«316.pep 



D TKESYKL EHVKGSVP N G K S E A K A V N P Y S SEGEHIREI P 

i ■ — — - — ■ 



T" 
160 



151 
151 
151 
-151 
151 



D 
D 
D 
D 
D 



T K E S 
T K E S 
T.KES 
TKES 

T K.E'S 



I 

170 
t 



YKLEHVKGS VPNGKS 
YKLEHVKGSVPNGKS 
Y K L EH VKGS VP NGKS 
r.K LEHVK.GS VP N G K S 

yHl e h v kHsH p n-g k 



180 
i 



ECTLSKRI 



151 .DTKESY-KLEHVK G/-S VPNGKS 



E AKAVNP YSSEGEH I RE 
E A K A V N P Y S S E G E H 4 . R E 
EAKAVNPYSSEGEH I RE 
E AKAVNPYSSEGEHI RE 
EAKAVNPYSSEGEH I RE 
E AKAVNPYSSEGEHI RE 



190 
JL 



S E V Majority 
200 



P 
P 



E G T L S 
E G T L S 



K R 
K R 



PEGTLSKRISEV sagl 408_2603. pep 
PEGTLSKRISEV _sagl408_515.pep 

S E V sagl408_18rs21.seq 
S E V sagl408_ej bit 1 . pep 
ECTLSKRI S.E V sagl408_ji36b.pep 
PEGTLSKR ISEV sagl408_Aei&i6.pep 



R Majority 



201 G D 
201 GD 
201 G D 
201 G D 
201 H D 
201 G D 

H N 




L A H N K Y K 
L A H N K Y K 
L A H N K Y K 
L A H N K Y K 
l|Q H N K Y K 
LAHKKYK 



E L T V S G K T I V K P V D K Q K P L D V V F V L D N S.N S M NNDGPNFQR sagl 4O8_J>603 pep 
ELTVSGKTIVKPVDKQKPLDVVFVLDNSNSMNNDCPNFQR sagl408ZsiS pep 
n E ^!^^ TIVKPVDKQKPLD ' VVFnDNSIl SONDGPBFQR sa^Os'lSrsIT seq 

E L T V S G K T I V K P VP K.QKPL DVVFVLDMSN SHUN ' " 

E L T V S G KB I D kIHI kEEI P L D V V F V L D N S N S uW&ll 



DGPNF 
G 




R sagi408__cjbl 1 1 ,pep 
sagl408.iJi36b.pep 



7 - " ' "™ *»«.-K««a »uu * ^ u v v r v L u h tx h|q u ^ vrBBfln sagl 4Q8-h36b. pep 
LTVSGKTI V K PVDKQKPLDVVFVLDNSNSMNND GPNFQR ^1408^31^ 



KAKKAAEALGTAVK D I L 
— — . , _ , — 



260 



G A N S D N R V A 
1 



270 



251 H N K A K K.A A E A L G T A V K D 

251 HNKAKKAAEALGTAVKD 

251 H N K A..K K A A E A L G T A V K D 

251 HNKAKKA AE A l,G T ,A V K D 

247 B NKAKK aQ-E'AQQ tDO K D 

251 iT N : K A K K A A E A t : G T A V K p 

- DD 



LGAKS 
L G A N S 
L G A N.S 
L G A N S 
;L .G. A ND 

l c. a n;s 



L V T Y G S D 
1 — 

280 

L 



IFDGRSVPVVKGFKE Majority 



DNRVALVTYGSD 
D N R V A L V T Y G S D 
DNRVALVTYGSD 
b N R V A L V T Y GS P 
H-N RE A L V .T Y G S D 
D N ;R V A L V;T Y G S D 



.290 
I 



I FDGRSVDVVK 
I F D G R ,S V D V V K 
I F D G R S V D V V.K 
I FDGRSVDVVK 

i v d g .kII vQ v||k 

I F D G JR S V ft V ^V K 



300 



CFKE* sagl408_^603.pep 
C F K E sagl408_5l5.pep 
CFKE sagi408_;18rs2i.se<i 
GFKE. sagH08_Gj bill, pep 
G F It E sagL4G8jh36b.^ep ^ 
G F K E sagl408^^eri£316;pep 



K Y Y £ L Q T K F T I Q.T E N 
310 



Y S.H'k .0 L T N .N AEBMKRIP 



320 



301 
s 301 
301 
301 
297 



PDKY.YGLQTKFT EQTEN 
> D K Y Y G L QT KFT I Q TEN 
DDK Y*Y dLQ.TKFT IQTEN 
D P K Y I'GUT K FT I Q f E N 
^EB V Y G lQ T0 F TlJ Q THE 
301 -D D K Y Y G L Q T KFT IQTEN 



* 

"330 
JL 



f E A.P K A G S T t N C I MatorlW 



. '** * 
* * . ' 



YSHKQLTN KA.E'E Ilk R 
YSHK5QLTNNAEEI I K R 
YSHKQLTNNAEE I I K R 
Y S H KOLT N N A EE I I K R 

y siiKignt nii Aim- i i kq 

YSHK Q L T.N N A* EE II K R 



340 

JL 



350 



T E A P K A K W G S T 
TEAP K*A KVGS.T 
T E A P K AKWGST 
T E A P -|C A K W GST 
0 E A pQ a K ff GOT 
TEA P|a AKWG-ST 




T P 



E Q Q K E Y Y , L S K V G E T F T M K A F 1C E A H D I LS O V M R N S 0 K I I V H V T D G V P f R Malorlt^ 



sagl4O8L2603 . pep - * 
sagl408j5I5.pep • 
sagl408_18rs2I .seq . r „ * 
sagl408_cjbl 11 .pep . ; ij; ' • " : . ' ' 

sag L408Lb36b . pep JV^*'S^'*^ :: ^^^^ 
sagl4d§_jaeffl316 . pep * 
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JL 



T P 

T P 

T P 

T P 

T P 
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E Q 
EQ 
E Q 

E 
E Q 
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JL 



QKEY.YLSKVGETFTMKAF.UEAD 
QKEYYLSKVGETFTMKAFMEAD 
QKEYYLSKVGETFTMKAFMEAD 

qkeyylskvgetftmkafmead 
e yIB lskvgetftmkafmead] 
q kH yyl-skvg. etftmkafmead 
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K~l\.-K 



D I LSQVNRN 
P I LSQVNRN 
D„ I LSQVNRN 
OIL S Q V N R N 

DDl s| 

D I L S Q V 



T 

390 
— I 



V H K H 
VlflR-N 



S Q K 
S Q K 
S Q K 
SO K 
sffiK 



i 

400 
L. 



I 
I 
I 
I 
I 



S Q K. I 



I V H V T D G 
I V H V-T D G 
I V H V T D G 
I VHVTDG 

i v h|It D G 
I v h|It d G 



V P T R 
VPTR 

V P T R 
VPTR 
VPTR. 
VPTR 



sag 1 408_2603 pep 
sag!408_S 15 .pep 
sagl 408_J8rs21 .seq 
sagl408_cj bill .pep 
sagI408_Ji36b. pep 
sagl 408_nem3 1 6 . pep 
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401 
401 
401 
401 

396 
401 



N F L L T D K P E D 1 KGNCESYF I. F Majority 

450 . 



tutu Tl» llitm [mil' 

. ^ 440 

SJ»I«»FKUASIE q EqJkOCuS , l !„ PB,D IKGKCESYFLF saglttSJlS.p^ 

s,..,.„ lo »"<,E"ss; tf ;» :: ....; t . 3ist2Sis£ 



PLDSYQTQI ISGNLOKLHYLBLNLNYP K G T I Y 



R N 



GPVKEH GTPTKLYFNS Majority 



451 
451 
451 
451 
446 
451 



PLDSYQTQI 
PLDSYQTQI 
P'LD S YQT Q.I 
PLDSYQTQI 
PLDSYQTQI 
PL DSYIJTQ I 
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I S C N L Q K L H Y L D L M L N Y P K C TO Y R H G P VHE S C ^ P ^ r , r , 2 f ^gl"0a_l8rsZl .seq 
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VYNEDYKKN 



REEAFELSDC 



501 
501 
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501 
496 
501 




Majority 



LKQKHVD 
L K Q K N Y D 
LKQKNY.D 
LKQKHYD 
L KtJ K N Y D 
LKQKII YD 



IFNFGIB ISGFRQVYKE 
IFNFGIDISGFRQVYNE 
IFNFGIDISGFRQVYNE 
IFNFGIDISGFRQVYNE 
I F N F G I D I S_G F R Q V Y N E 
IFNFGIDIS 



E LMRS FSS 



YKKNQDGTFQKLKEEAfBlSDGEIT sa*U408 2603 neo 

Majority 
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ELM 
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R S 
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* S 
S 
S 
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I R 
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F S 
F S 
F.S 
F S 
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K P 
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Q 
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L T K E N 
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Majority 
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qDl 
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Q P 
Q P 



s D 

S D 
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YT 
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YT 
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I ATGGPNNDGG ILK sagl 408^603. pep 
I A T G G PNNDGG ILK sag 1408^5 15. pep 
I ATGGPNNDGG I LK sagl408_18rs21 .seq 
I. ATGGPNNDGG ILK sagl408_cjbIII.pep 
IATGGPNNDGGILK sagl408_b36b.pep 
I ATGGPNNDGG I LK W^ne^ieTpep 



Y D V K I D D S 



651 G V K L 
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YDVKLDDS 
YDVKLDDS 
YDVKLDDS 
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Y Q V : K LDDS 
YDVKLDDS 
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F I S N K F Y D T N G R T T L sagl408_515.pep - 
P I S N K F Y D T K G R T T L sagi408^l8r521;seq 
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G E I E F|I K V T> K D N N K L sagk408„^JMU .6ep 
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sag!408_18rs2l .seq 
sagl408jdjbll 1 .pep 
sagl408_h36b .pep 
sagl408_jiea3 16 .pep 
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E A V 'S P E D 
E A V S P E D 
EAVSPEB 
E A V S PH D 
E A V S P|9 D 
E A V S PH D 



YQK I T N K P I LTFEVVK CS 
YQK ITNKPUTFEVVKGS 
YQKITHKPILTFEVVKGS 
YQK ITHKPILTFEVVKGS 
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I A V K 
I A V N 
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K Q 
K Q 
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KQ 
K Q 



S 
S 
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s 
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E Y H E 

E Y II E 

E Y H E 

E Y H E 

E Y H E 

E Y H E 



E G 
E G 
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G 
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E G 
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D 
D 
D 
D 
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K II 
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K H 
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L 
L 
L 
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L 
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8S1 
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851 
846 
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I Y 1 W K R Y K 
890 



T H 
T H 
T H 
TJI 
T H 



IPPKG 
IPPRG 
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IPPKG 
IPPKG 



THIPPKG 
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I PMTGGKG I LSFI LI G. GAUMS IAGG 

ipmtggkgilsfiligg|hmsiacg 



Y 
Y 
Y 
Y 
Y 
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W K R Y K 
WKH Y K 
IK R Y K 
W K R Y K 
W K R|B K 
W K R Y K 




I T N sagl 408^2603. pep 
I T N sagl 408^5 1 5. pep 
I T N sagl408_l8rs2l.seq 
I TN sagl408_cjblll.pep 
ITU sagI408_h36b.pep. 
ITS sagl408_nem3l6.pep 

I E K Majority 
900 . 

TjjK sag 1408^2603, pep 
I O K sag!408_515.pep 



iQK sag.1408_I8rs21.seq 
_ E K sagl408_cj bill .pep 
I E K sagl 408_Ji36b. pep 
ffl EK sag 1 408_nera3 16; pep 

Majority 
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D 
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901 D 



Decoration 'Decoration #1': Shade (with solid black) residues that differ. from the Consensus 



sagl408_2603 . pep 
sagl4Q8_5 1 5 . pep 
sagI408_I8rs2I .seq 
sagl408_cjbl 1 1 .pep 
sag 1408 Jh36b. pep 
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